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[ ] in the declaration; 

[ ] a certified copy of the priority document; 

[ ] a General Authorization for Petitions for Extensions of Time and Payment of Fees; 

[ ] statement(s) claiming small entity status; 

[ ] an Assignment document; 

[XI an Information Disclosure Statement; and 
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Basic Application Fee 
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Total Application Fee 
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P.O. Box 1404 
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Assistant Commissioner for Patents 
Washington, D.C. 20231 

Sir: 

This Application is a continuation-in-part of U.S. Application Serial No. 
09/229,319, filed January 13, 1999, which claims benefit of U.S. Provisional Application 
No. 60/071,449, filed January 13, 1998, and U.S. Provisional Application No. 
60/105,511, filed October 23, 1998. 

Prior to examination on the merits, please amend the above-identified Application 
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REMARKS 

Applicants retain the right to reintroduce any subject matter which was canceled 
without prejudice by the present Preliminary Amendment at any time during the 
prosecution of this Application or any continuation or divisional application thereof in the 
United States. 

In the event that there are any questions relating to this Preliminary Amendment, or 
the Application in general, it would be appreciated if the Examiner would telephone the 
undersigned attorney concerning such questions so that prosecution of this Application may 
be expedited. 
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THE HIGH BONE MASS GENE OF llql3.3 
RELATED APPLICATIONS 

This application is a continuation-in-part of Application No. 09/229,319, filed January 
13, 1999, which claims benefit of U.S. Provisional Application No, 60/071,449, filed January 
13, 1998, and U.S. Provisional Application No. 60/105,51 1, filed October 23, 1998, all of 
which are herein incorporated by reference in their entirety. 

FIELD OF THE INVENTION 

The present invention relates generally to the field of genetics, genomics and 
molecular biology. More particularly, the invention relates to methods and materials used to 
isolate, detect and sequence a high bone mass gene and corresponding wild-type gene, and 
mutants thereof. The present invention also relates to the high bone mass gene, the 
corresponding wild-type gene, and mutants thereof The genes identified in the present 
invention are implicated in the ontology and physiology of bone development. The invention 
also provides nucleic acids, proteins, cloning vectors, expression vectors, transformed hosts, 
methods of developing pharmaceutical compositions, methods of identifying molecules 
involved in bone development, and methods of diagnosing and treating diseases involved in 
bone development. In preferred embodiments, the present invention is directed to methods 
for treating, diagnosing, preventing and screening for normal and abnormal conditions of 
bone, including metabolic bone diseases such as osteoporosis. 

BACKGROUND OF THE INVENTION 

Two of the most common types of osteoporosis are postmenopausal and senile 
osteoporosis. Osteoporosis affects men as well as women, and, taken with other 
abnormalities of bone, presents an ever-increasing health risk for an aging population. The 
most common type of osteoporosis is that associated with menopause. Most women lose 
between 20-60% of the bone mass in the trabecular compartment of the bone within 3-6 years 
after the cessation of menses. This rapid loss is generally associated with an increase of bone 
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resorption and formation. However, the resorptive cycle is more dominant and the result is a 
net loss of bone mass. Osteoporosis is a common and serious disease among postmenopausal 
women. There are an estimated 25 million women in the United States alone who are 
afflicted with this disease. The results of osteoporosis are both personally harmful, and also 
5 account for a large economic loss due to its chronicity and the need for extensive and long- 
term support (hospitalization and nursing home care) from the disease sequelae. This is 
especially true in more elderly patients. Additionally, while osteoporosis is generally not 
thought of as a life-threatening condition, a 20-30% mortality rate is related to hip fractures in 
elderly women. A large percentage of this mortality rate can be directly associated with 

10 postmenopausal osteoporosis. 

The most vulnerable tissue in the bone to the effects of postmenopausal osteoporosis 
is the trabecular bone. This tissue is often referred to as spongy bone and is particularly 
concentrated near the ends of the bone near the joints and in the vertebrae of the spine. The 
trabecular tissue is characterized by small structures which inter-connect with each other as 

15 well as the more solid and dense cortical tissue which makes up the outer surface and central 
shaft of the bone. This criss-cross network of trabeculae gives lateral support to the outer 
cortical structure and is critical to the biomechanical strength of the overall structure. In 
postmenopausal osteoporosis, it is primarily the net resorption and loss of the trabeculae 
which lead to the failure and fracture of the bone. In light of the loss of the trabeculae in 

20 postmenopausal women, it is not surprising that the most common fractures are those 

associated with bones which are highly dependent on trabecular support, e.g., the vertebrae, 
the neck of the femur, and the forearm. Indeed, hip fracture, Colle's fractures, and vertebral 
crush fractures are indicative of postmenopausal osteoporosis. 

One of the earliest generally accepted methods for treatment of postmenopausal 

25 osteoporosis was estrogen replacement therapy. Although this therapy frequently is 

successful, patient compliance is low, primarily due to the undesirable side-effects of chronic 
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estrogen treatment. Frequently cited side-effects of estrogen replacement therapy include 
reinitiation of menses, bloating, depression, and fear of breast or uterine cancer. In order to 
limit the known threat of uterine cancer in those women who have not undergone a 
hysterectomy, a protocol of estrogen and progestin cyclic therapy is often employed. This 
5 protocol is similar to that which is used in birth control regimens, and often is not tolerated by 
women because of the side-effects characteristic of progestin. More recently, certain 
antiestrogens, originally developed for the treatment of breast cancer, have been shown in 
experimental models of postmenopausal osteoporosis to be efficacious. Among these agents 
is raloxifene (See, U.S. Patent No. 5,393,763, and Black et al, J. Clin. Invest, 93:63-69 
10 (1 994)). In addition, tamoxifene, a widely used clinical agent for the treatment of breast 

cancer, has been shown to increase bone mineral density in post menopausal women suffering 
from breast cancer (Love et al, N. Engl J. Med,, 326:852-856 (1992)). 

Another therapy for the treatment of postmenopausal osteoporosis is the use of 
calcitonin. Calcitonin is a naturally occurring peptide which inhibits bone resorption and has 
15 been approved for this use in many countries (Overgaard et al, Br. Med. 1, 305:556-561 
(1992)). The use of calcitonin has been somewhat limited, however. Its effects are very 
modest in increasing bone mineral density and the treatment is very expensive. Another 
therapy for the treatment of postmenopausal osteoporosis is the use of bis-phosphonates. 
These compounds were originally developed for use in Paget's disease and malignant 
20 hypercalcemia. They have been shown to inhibit bone resorption. Alendronate, one 

compound of this class, has been approved for the treatment of postmenopausal osteoporosis. 
These agents may be helpful in the treatment of osteoporosis, but these agents also have 
potential liabilities which include osteomalacia, extremely long half-life in bone (greater than 
2 years), and possible "frozen bone syndrome," e.g., the cessation of normal bone remodeling. 
25 Senile osteoporosis is similar to postmenopausal osteoporosis in that it is marked by 

the loss of bone mineral density and resulting increase in fracture rate, morbidity, and 
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associated mortality. Generally, it occurs in later life, i.e., after 70 years of age. Historically, 
senile osteoporosis has been more common in females, but with the advent of a more elderly 
male population, this disease is becoming a major factor in the health of both sexes. It is not 
clear what, if any, role hormones such as testosterone or estrogen have in this disease, and its 
5 etiology remains obscure. Treatment of this disease has not been very satisfactory. Hormone 
therapy, estrogen in women and testosterone in men, has shown equivocal results; calcitonin 
and bis-phosphonates may be of some utility, 

The peak mass of the skeleton at maturity is largely under genetic control. Twin 
studies have shown that the variance in bone mass between adult monozygotic twins is 
10 smaller than between dizygotic twins (Slemenda et al, J. Bone Miner. Res., 6:561-567 (1991); 
Young et al, J. Bone Miner. Res., 6:561-567 (1995); Pocock et al, J. Clin. Invest., 80:706-710 
(1987); Kelly et al, J. Bone Miner. Res., 8:11-17 (1993)), and it has been estimated that up to 
60% or more of the variance in skeletal mass is inherited (Krall et al, J. Bone Miner Res., 
10:S367 (1993)). Peak skeletal mass is the most powerful determinant of bone mass in 
15 elderly years (Hui et al, Ann. Int. Med., 1 1 1 :355-361 (1989)), even though the rate of age- 
related bone loss in adult and later life is also a strong determinant (Hui et al, Osteoporosis 
Int., 1 :30-34 (1995)). Since bone mass is the principal measurable determinant of fracture 
risk, the inherited peak skeletal mass achieved at maturity is an important determinant of an 
individual's risk of fracture later in life. Thus, study of the genetic basis of bone mass is of 
20 considerable interest in the etiology of fractures due to osteoporosis. 

Recently, a strong interest in the genetic control of peak bone mass has developed in 
the field of osteoporosis. The interest has focused mainly on candidate genes with suitable 
polymorphisms to test for association with variation in bone mass within the normal range, or 
has focused on examination of genes and gene loci associated with low bone mass in the 
25 range found in patients with osteoporosis. The vitamin D receptor locus (VDR) (Morrison et 
al, Nature, 367:284-287 (1994)), PTH gene (Howard et al, J. Clin. Endocrinol. Metab., 
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80:2800-2805 (1995); Johnson et al, J. Bone Miner. Res., 8:11-17 (1995); Gong et al, /. Bone 
Miner. Res., 10:S462 (1995)) and the estrogen receptor gene (Hosoi et al, J. Bone Miner. 
Res., 10:S170 (1995); Morrison et al, Nature, 367:284-287 (1994)) have figured most 
prominently in this work. These studies are difficult because bone mass (the phenotype) is a 
5 continuous, quantitative, polygenic trait, and is confounded by environmental factors such as 
nutrition, co-morbid disease, age, physical activity, and other factors. Also, this type of study 
design requires large numbers of subjects. In particular, the results of VDR studies to date 
have been confusing and contradictory (Garnero et al, J. Bone Miner. Res., 10:1283-1288 
(1995); Eisman et al, J. Bone. Miner. Res., 10:1289-1293 (1995); Peacock, J. Bone Miner. 
10 Res., 10:1294-1297 (1995)). Furthermore, the work thus far has not shed much light on the 
mechanism(s) whereby the genetic influences might exert their effect on bone mass. 

While it is well known that peak bone mass is largely determined by genetic rather 
than environmental factors, studies to determine the gene loci (and ultimately the genes) 
linked to variation in bone mass are difficult and expensive. Study designs which utilize the 
15 power of linkage analysis, e.g., sib-pair or extended family, are generally more informative 
than simple association studies, although the latter do have value. However, genetic linkage 
studies involving bone mass are hampered by two major problems. The first problem is the 
phenotype, as discussed briefly above. Bone mass is a continuous, quantitative trait, and 
establishing a discrete phenotype is difficult. Each anatomical site for measurement may be 
20 influenced by several genes, many of which may be different from site to site. The second 
problem is the age component of the phenotype. By the time an individual can be identified 
as having low bone mass, there is a high probability that their parents or other members of 
prior generations will be deceased and therefore unavailable for study, and younger 
generations may not have even reached peak bone mass, making their phenotyping uncertain 

25 for genetic analysis. 

Regardless, linkage analysis can be used to find the location of a gene causing a 

-5- 
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hereditary "disorder" and does not require any knowledge of the biochemical nature of the 
disorder, i.e., a mutated protein that is believed to cause the disorder does not need to be 
known. Traditional approaches depend on assumptions concerning the disease process that 
might implicate a known protein as a candidate to be evaluated. The genetic localization 
5 approach using linkage analysis can be used to first find the general chromosomal region in 
which the defective gene is located and then to gradually reduce the size of the region in order 
to determine the location of the specific mutated gene as precisely as possible. After the gene 
itself is discovered within the candidate region, the messenger RNA and the protein are 
identified and, along with the DNA, are checked for mutations. 

10 The genetic localization approach has practical implications since the location of the 

disease can be used for prenatal diagnosis even before the altered gene that causes the disease 
is found. Linkage analysis can enable families, even many of those that do not have a sick 
child, to know whether they are carriers of a disease gene and to evaluate the condition of an 
unborn child through molecular diagnosis. The transmission of a disease within families, 

15 then, can be used to find the defective gene. As used herein, reference to "high bone mass" 
(HBM) is analogous to reference to a disease state, although from a practical standpoint high 
bone mass can actually help a subject avoid the disease known as osteoporosis. 

Linkage analysis is possible because of the nature of inheritance of chromosomes 
from parents to offspring. During meiosis, the two parental homologues pair to guide their 

20 proper separation to daughter cells. While they are lined up and paired, the two homologues 
exchange pieces of the chromosomes, in an event called "crossing over" or "recombination." 
The resulting chromosomes are chimeric, that is, they contain parts that originate from both 
parental homologues. The closer together two sequences are on the chromosome, the less 
likely that a recombination event will occur between them, and the more closely linked they 

25 are. In a linkage analysis experiment, two positions on the chromosomes are followed from 
one generation to the next to determine the frequency of recombination between them. In a 
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study of an inherited disease, one of the chromosomal positions is marked by the disease gene 
or its normal counterpart, i.e., the inheritance of the chromosomal region can be determined 
by examining whether the individual displays symptoms of the disorder or not. The other 
position is marked by a DNA sequence that shows natural variation in the population such 
5 that the two homologues can be distinguished based on the copy of the "marker" sequence 
that they possess. In every family, the inheritance of the genetic marker sequence is 
compared to the inheritance of the disease state. If, within a family carrying an autosomal 
dominant disorder such as high bone mass, every affected individual carries the same form of 
the marker and all the unaffected individuals carry at least one different form of the marker, 

10 there is a great probability that the disease gene and the marker are located close to each 
other. In this way, chromosomes may be systematically checked with known markers and 
compared to the disease state. The data obtained from the different families is combined, and 
analyzed together by a computer using statistical methods. The result is information 
indicating the probability of linkage between the genetic marker and the disease allowing 

15 different distances between them. A positive result can mean that the disease is very close to 
the marker, while a negative result indicates that it is far away on that chromosome, or on an 
entirely different chromosome. 

Linkage analysis is performed by typing all members of the affected family at a given 
marker locus and evaluating the co-inheritance of a particular disease state with the marker 

20 probe, thereby determining how often the two of them are co-inherited. The recombination 
frequency can be used as a measure of the genetic distance between two gene loci. A 
recombination frequency of 1% is equivalent to 1 map unit, or 1 centiMorgan (cM), which is 
roughly equivalent to 1,000 kb of DNA. This relationship holds up to frequencies of about 
20% or20cM. 

25 The entire human genome is 3,300 cM long. In order to find an unknown disease 

gene within 5-10 cM of a marker locus, the whole human genome can be searched with 
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roughly 330 informative marker loci spaced at approximately 10 cM intervals (Botstein et al, 
Am. J. Hum. Genet, 32:314-331 (1980)). The reliability of linkage results is established by 
using a number of statistical methods. The method most commonly used for the analysis of 
linkage in humans is the LOD score method (Morton, Prog. Clin. Biol. Res., 147:245-265 
5 (1984), Morton et al, Am. J. Hum. Genet., 38:868-883 (1986)) which was incorporated into 
the computer program LIPED by Ott, Am. J. Hum. Genet, 28:528-529 (1976). LOD scores 
are the logarithm of the ratio of the likelihood that two loci are linked at a given distance to 
that they are not linked (>50 cM apart). The advantage of using logarithmic values is that 
they can be summed among families with the same disease. This becomes necessary given 
10 the relatively small size of human families. 

By convention, a total LOD score greater than + 3.0 (that is, odds of linkage at the 
specified recombination frequency being 1000 times greater than odds of no linkage) is 
considered to be significant evidence for linkage at that particular recombination frequency. 
A total LOD score of less than - 2.0 (that is, odds of no linkage being 100 times greater than 
15 odds of linkage at the specified frequency) is considered to be strong evidence that the two 
loci under consideration are not linked at that particular recombination frequency. Until 
recently, most linkage analyses have been performed on the basis of two-point data, which is 
the relationship between the disorder under consideration and a particular genetic marker. 
However, as a result of the rapid advances in mapping the human genome over the last few 
20 years, and concomitant improvements in computer methodology, it has become feasible to 
carry out linkage analyses using multi-point data. Multi-point analysis provide a 
simultaneous analysis of linkage between the disease and several linked genetic markers, 
when the recombination distance among the markers is known. 

Multi-point analysis is advantageous for two reasons. First, the informativeness of the 
25 pedigree is usually increased. Each pedigree has a certain amount of potential information, 
dependent on the number of parents heterozygous for the marker loci and the number of 
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affected individuals in the family. However, few markers are sufficiently polymorphic as to 
be informative in all those individuals. If multiple markers are considered simultaneously, 
then the probability of an individual being heterozygous for at least one of the markers is 
greatly increased. Second, an indication of the position of the disease gene among the 
5 markers may be determined. This allows identification of flanking markers, and thus 

eventually allows isolation of a small region in which the disease gene resides. Lathrop et al, 
Proc. Natl Acad. Sci. USA, 81:3443-3446 (1984) have written the most widely used 
computer package, LINKAGE, for multi-point analysis. 

There is a need in the art for identifying the gene associated with a high bone mass 
10 phenotype. The present invention is directed to this, as well as other, important ends. 

SUMMARY OF THF INVENTION 
The present invention describes the Zmaxl gene and the HBM gene on chromosome 
1 lql3.3 by genetic linkage and mutation analysis. The use of additional genetic markers 
linked to the genes has aided this discovery. By using linkage analysis and mutation analysis, 
15 persons predisposed to HBM may be readily identified. Cloning methods using Bacterial 
Artificial Chromosomes have enabled the inventors to focus on the chromosome region of 
1 lql3.3 and to accelerate the sequencing of the autosomal dominant gene. In addition, the 
invention identifies the Zmaxl gene and the HBM gene, and identifies the guanine-to- 
thymine polymorphism mutation at position 582 in the Zmaxl gene that produces the HBM 
20 gene and the HBM phenotype. 

The present invention identifies the Zmaxl gene and the HBM gene, which can be 
used to determine if people are predisposed to HBM and, therefore, not susceptible to 
diseases characterized by reduced bone density, including, for example, osteoporosis, or are 
predisposed and susceptible to diseases characterized by abnormally high bone density, such 
25 as, for example, osteoporosis. Older individuals carrying the HBM gene express the HBM 
protein, and, therefore, do not develop osteoporosis. In other words, the HBM gene is a 
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suppressor of osteoporosis. This in vivo observation is a strong evidence that treatment of 
normal individuals with the HBM gene or protein, or fragments thereof, will ameliorate 
osteoporosis. 

Moreover, such treatment will be indicated in the treatment of bone lesions, 
5 particularly bone fractures, for bone remodeling in the healing of such lesions. For example, 
persons predisposed to or suffering from stress fractures (i.e., the accumulation of stress- 
induced microfractures, eventually resulting in a true fracture through the bone cortex) may 
be identified and/or treated by means of the invention. Moreover, the methods and 
compositions of the invention will be of use in the treatment of secondary osteoporosis, 
10 where the course of therapy involves bone remodeling, such as endocrine conditions 

accompanying corticosteroid administration, hyperthyroidism, hypogonadism, hematologic 
malignancies, malabsorption and alcoholism, as well as disorders associated with vitamin D 
and/or phosphate metabolism, such as osteomalacia and rickets, and diseases characterized by 
abnormal or disordered bone remodeling, such as Paget's disease, and in neoplasms of bone, 
15 which may be benign or malignant. 

In various embodiments, the present invention is directed to nucleic acids, proteins, 
vectors, and transformed hosts of HBM and Zmaxl. 

Additionally, the present invention is directed to applications of the above 
embodiments of the invention including, for example, gene therapy, pharmaceutical 
20 development, and diagnostic assays for bone development disorders. In preferred 
embodiments, the present invention is directed to methods for treating, diagnosing, 
preventing and screening for osteoporosis. 

These and other aspects of the present invention are described in more detail below. 

25 BRIEF DESCRIPTION OF THE FIGURES 

Fig. 1 shows the pedigree of the individuals used in the genetic linkage studies. 
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Under each individual is an ID number, the z-score for spinal BMD, and the allele calls for 
the critical markers on chromosome 11. Solid symbols represent "affected" individuals. 
Symbols containing "N" are "unaffected" individuals. DNA from 37 individuals was 
genotyped. Question marks denote unknown genotypes or individuals who were not 
genotyped. 

Fig. 2 depicts the BAC/STS content physical map of the HBM region in 1 lql3.3. 
STS markers derived from genes, ESTs, microsatellites, random sequences, and BAC 
endsequences are denoted above the long horizontal line. For markers that are present in 
GDB the same nomenclature has been used. Locus names (Dl 1S####) are listed in 
parentheses after the primary name if available. STSs derived from BAC endsequences are 
listed with the BAC name first followed by L or R for the left and right end of the clone, 
respectively. The two large arrows indicate the genetic markers that define the HBM critical 
region. The horizontal lines below the STSs indicate BAC clones identified by PCR-based 
screening of a nine-fold coverage BAC library. Open circles indicate that the marker did not 
amplify the corresponding BAC library address during library screening. Clone names use 
the following convention: B for BAC, the plate, row and column address, followed by -H 
indicating the HBM project (i.e., B36F16-H). 

Figs. 3A-3F show the genomic structure of Zmaxl with flanking intron sequences. 
Translation is initiated by the underlined "ATG" in exon 1. The site of the polymorphism in 
the HBM gene is in exon 3 and is represented by the underlined "G," whereby this nucleotide 
is a "T" in the HBM gene. The 3 f untranslated region of the mRNA is underlined within exon 
23 (exon 1, SEQ ID NO:40; exon 2, SEQ ID NO:41; exon 3, SEQ ID NO:42; exon 4, SEQ ID 
NO:43; exon 5, SEQ ID NO:44; exon 6, SEQ ID NO:45; exon 7, SEQ ID NO:46; exon 8, 
SEQ ID NO:47; exon 9, SEQ ID NO:48; exon 10, SEQ ID NO:49; exon 11, SEQ ID NO:50; 
exon 12, SEQ ID NO:51; exon 13, SEQ ID NO:52; exon 14, SEQ ID NO:53; exon 15, SEQ 
ID NO:54; exon 16, SEQ ID NO:55; exon 17, SEQ ID NO:56; exon 18, SEQ ID NO:57; 
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exon 19, SEQ ID NO:58; exon 20, SEQ ID NO:59; exon 21, SEQ ID NO:60; exon 22, SEQ 
ID NO:61; and exon 23; SEQ ID NO:62). 

Fig. 4 shows the domain organization of Zmaxl, including the YWTD spacers, the 
extracellular attachment site, the binding site for LDL and calcium, the cysteine-rich growth 
5 factor repeats, the transmembrane region, the ideal PEST region with the CK-II 

phosphorylation site and the internalization domain. Fig. 4 also shows the site of the glycine 
to valine change that occurs in the HBM protein. The signal peptide is located at amino acids 
1-22, the extracellular domain is located at amino acids 23-1385, the transmembrane segment 
is located at amino acids 1386-1413, and the cytoplasmic domain is located at amino acids 
10 1414-1615. 

Fig. 5 is a schematic illustration of the BAC contigs B527D12 and B200E21 in 
relation to the HBM gene. 

Figs. 6A-6E are the nucleotide and amino acid sequences of the wild-type gene, 
Zmaxl . The location for the base pair substitution at nucleotide 582, a guanine to thymine, is 
15 underlined. This allelic variant is the HBM gene. The HBM gene encodes for a protein with 
an amino acid substitution of glycine to valine at position 171. The 5 ! untranslated region 
(UTR) boundaries bases 1 to 70, and the 3' UTR boundaries bases 4916-5120. 

Figs. 7 A and 7B are northern blot analyses showing the expression of Zmaxl in 
various tissues. 
20 Fig. 8 is a PCR product analysis. 

Fig. 9 is allele specific oligonucleotide detection of the Zmaxl exon 3 mutation. 

Fig. 10 is the cellular localization of mouse Zmaxl by in situ hybridization at 100X 
magnification using sense and antisense probes. 

Fig. 11 is the cellular localization of mouse Zmaxl by in situ hybridization at 400X 
25 magnification using sense and antisense probes. 

Fig. 12 is the cellular localization of mouse Zmaxl by in situ hybridization of 
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osteoblasts in the endosteum at 400X magnification using sense and antisense probes. 
Fig. 13 shows antisense inhibition of Zmaxl expression in MC-3T3 cells. 

DFTATI ED DESCRI PTION OF THE INVENTION 

To aid in the understanding of the specification and claims, the following definitions 
are provided. 

"Gene" refers to a DNA sequence that encodes through its template or messenger 
RNA a sequence of amino acids characteristic of a specific peptide. The term "gene" includes 
intervening, non-coding regions, as well as regulatory regions, and can include 5' and 3' ends. 

"Gene sequence" refers to a DNA molecule, including both a DNA molecule which 
contains a non-transcribed or non-translated sequence. The term is also intended to include 
any combination of gene(s), gene fragment(s), non-transcribed sequence(s) or non-translated 
sequence(s) which are present on the same DNA molecule. 

The sequences of the present invention may be derived from a variety of sources 
including DNA, cDNA, synthetic DNA, synthetic RNA or combinations thereof. Such 
sequences may comprise genomic DNA which may or may not include naturally occurring 
introns. Moreover, such genomic DNA may be obtained in association with promoter regions 
or poly (A) sequences. The sequences, genomic DNA or cDNA may be obtained in any of 
several ways. Genomic DNA can be extracted and purified from suitable cells by means well 
known in the art. Alternatively, mRNA can be isolated from a cell and used to produce 
cDNA by reverse transcription or other means. 

"cDNA" refers to complementary or copy DNA produced from an RNA template by 
the action of RNA-dependent DNA polymerase (reverse transcriptase). Thus, a "cDNA 
clone" means a duplex DNA sequence complementary to an RNA molecule of interest, 
carried in a cloning vector or PCR amplified. This term includes genes from which the 
intervening sequences have been removed. 
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"Recombinant DNA" means a molecule that has been recombined by in vitro splicing 
cDNA or a genomic DNA sequence. 

"Cloning" refers to the use of in vitro recombination techniques to insert a particular 
gene or other DNA sequence into a vector molecule. In order to successfully clone a desired 
gene, it is necessary to use methods for generating DNA fragments, for joining the fragments 
to vector molecules, for introducing the composite DNA molecule into a host cell in which it 
can replicate, and for selecting the clone having the target gene from amongst the recipient 
host cells. 

"cDNA library" refers to a collection of recombinant DNA molecules containing 
cDNA inserts which together comprise the entire genome of an organism. Such a cDNA 
library can be prepared by methods known to one skilled in the art and described by, for 
example, Cowell and Austin, "cDNA Library Protocols," Methods in Molecular Biology 
(1997). Generally, RNA is first isolated from the cells of an organism from whose genome it 
is desired to clone a particular gene. 

"Cloning vehicle" refers to a plasmid or phage DNA or other DNA sequence which is 
able to replicate in a host cell. The cloning vehicle is characterized by one or more 
endonuclease recognition sites at which such DNA sequences may be cut in a determinable 
fashion without loss of an essential biological function of the DNA, which may contain a 
marker suitable for use in the identification of transformed cells. 

"Expression control sequence" refers to a sequence of nucleotides that control or 
regulate expression of structural genes when operably linked to those genes. These include, 
for example, the lac systems, the trp system, major operator and promoter regions of the 
phage lambda, the control region of fd coat protein and other sequences known to control the 
expression of genes in prokaryotic or eukaryotic cells. Expression control sequences will 
vary depending on whether the vector is designed to express the operably linked gene in a 
prokaryotic or eukaryotic host, and may contain transcriptional elements such as enhancer 
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elements, termination sequences, tissue-specificity elements and/or translational initiation and 
termination sites. 

"Expression vehicle" refers to a vehicle or vector similar to a cloning vehicle but 
which is capable of expressing a gene which has been cloned into it, after transformation into 
5 a host. The cloned gene is usually placed under the control of (i.e., operably linked to) an 
expression control sequence. 

"Operator" refers to a DNA sequence capable of interacting with the specific 
repressor, thereby controlling the transcription of adjacent gene(s). 

"Promoter" refers to a DNA sequence that can be recognized by an RNA polymerase. 
10 The presence of such a sequence permits the RNA polymerase to bind and initiate 
transcription of operably linked gene sequences. 

"Promoter region" is intended to include the promoter as well as other gene sequences 
which may be necessary for the initiation of transcription. The presence of a promoter region 
is sufficient to cause the expression of an operably linked gene sequence. 
15 "Operably linked" means that the promoter controls the initiation of expression of the 

gene. A promoter is operably linked to a sequence of proximal DNA if upon introduction 
into a host cell the promoter determines the transcription of the proximal DNA sequence(s) 
into one or more species of RNA. A promoter is operably linked to a DNA sequence if the 
promoter is capable of initiating transcription of that DNA sequence. 
20 "Prokaryote" refers to all organisms without a true nucleus, including bacteria. 

"Eukaryote" refers to organisms and cells that have a true nucleus, including 
mammalian cells. 

"Host" includes prokaryotes and eukaryotes, such as yeast and filamentous fungi, as 
well as plant and animal cells. The term includes an organism or cell that is the recipient of a 
25 replicable expression vehicle. 

"Fragment" of a gene refers to any variant of the gene that possesses the biological 
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activity of that gene. 

"Variant" refers to a gene that is substantially similar in structure and biological 
activity or immunological characteristics to either the entire gene or to a fragment of the gene. 
Provided that the two genes possess a similar activity, they are considered variant as that 
5 term is used herein even if the sequence of amino acid residues is not identical. 

"Amplification of nucleic acids" refers to methods such as polymerase chain reaction 
(PCR), ligation amplification (or ligase chain reaction, LCR) and amplification methods 
based on the use of Q-beta replicase. These methods are well known in the art and described, 
for example, in U.S. Patent Nos. 4,683,195 and 4,683,202. Reagents and hardware for 

10 conducting PCR are commercially available. Primers useful for amplifying sequences from 
the HBM region are preferably complementary to, and hybridize specifically to sequences in 
the HBM region or in regions that flank a target region therein, HBM sequences generated by 
amplification may be sequenced directly. Alternatively, the amplified sequence(s) may be 
cloned prior to sequence analysis. 

15 "Antibodies" may refer to polyclonal and/or monoclonal antibodies and fragments 

thereof, and immunologic binding equivalents thereof, that can bind to the HBM proteins and 
fragments thereof or to nucleic acid sequences from the HBM region, particularly from the 
HBM locus or a portion thereof The term antibody is used both to refer to a homogeneous 
molecular entity, or a mixture such as a serum product made up of a plurality of different 

20 molecular entities. Proteins may be prepared synthetically in a protein synthesizer and 

coupled to a carrier molecule and injected over several months into rabbits. Rabbit sera is 
tested for immunoreactivity to the HBM protein or fragment. Monoclonal antibodies may be 
made by injecting mice with the proteins, or fragments thereof. Monoclonal antibodies will 
be screened by ELISA and tested for specific immunoreactivity with HBM protein or 

25 fragments thereof Harlow et al, Antibodies: A Laboratory Manual, Cold Spring Harbor 
Laboratory, Cold Spring Harbor, NY (1988). These antibodies will be useful in assays as 
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well as pharmaceuticals. 

"HBM" refers to high bone mass. 

"HBM protein" refers to a protein that is identical to a Zmaxl protein except that it 
contains an alteration of glycine 171 to valine. An HBM protein is defined for any organism 
5 that encodes a Zmaxl true homologue. For example, a mouse HBM protein refers to the 
mouse Zmaxl protein having the glycine 170 to valine substitution. 

"HBM gene" refers to the genomic DNA sequence found in individuals showing the 
HBM characteristic or phenotype, where the sequence encodes the protein indicated by SEQ 
ID NO: 4. The HBM gene and the Zmaxl gene are allelic. The protein encoded by the HBM 
10 gene has the property of causing elevated bone mass, while the protein encoded by the Zmaxl 
gene does not. The HBM gene and the Zmaxl gene differ in that the HBM gene has a 
thymine at position 582, while the Zmaxl gene has a guanine at position 582. The HBM 
gene comprises the nucleic acid sequence shown as SEQ ID NO: 2. The HBM gene may also 
be referred to as an "HBM polymorphism." 
15 "Normal," "wild-type," "unaffected" and "Zmaxl" all refer to the genomic DNA 

sequence that encodes the protein indicated by SEQ ID NO: 3. The Zmaxl gene has a 
guanine at position 582. The Zmaxl gene comprises the nucleic acid sequence shown as 
SEQ ID NO: 1. "Normal," "wild-type," "unaffected" and "Zmaxl" also refer to allelic 
variants of the genomic sequence that encodes proteins that do not contribute to elevated bone 
20 mass. The Zmaxl gene is common in the human population, while the HBM gene is rare. 

"5YWT+EGF" refers to a repeat unit found in the Zmaxl protein, consisting of five 
YWT repeats followed by an EGF repeat. 

"Bone development" generally refers to any process involved in the change of bone 
over time, including, for example, normal development, changes that occur during disease 
25 states, and changes that occur during aging. "Bone development disorder" particularly refers 
to any disorders in bone development including, for example, changes that occur during 
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disease states and changes that occur during aging. Bone development may be progressive or 
cyclical in nature. Aspects of bone that may change during development include, for 
example, mineralization, formation of specific anatomical features, and relative or absolute 
numbers of various cell types. 

"Bone modulation" or "modulation of bone formation" refers to the ability to affect 
any of the physiological processes involved in bone remodeling, as will be appreciated by one 
skilled in the art, including, for example, bone resorption and appositional bone growth, by, 
inter alia, osteoclastic and osteoblastic activity, and may comprise some or all of bone 
formation and development as used herein. 

"Normal bone density" refers to a bone density within two standard deviations of a Z 

score of 0. 

A "Zmaxl system" refers to a purified protein, cell extract, cell, animal, human or any 
other composition of matter in which Zmaxl is present in a normal or mutant form. 

A "surrogate marker" refers to a diagnostic indication, symptom, sign or other feature 
that can be observed in a cell, tissue, human or animal that is correlated with the HBM gene 
or elevated bone mass or both, but that is easier to measure than bone density. The general 
concept of a surrogate marker is well accepted in diagnostic medicine. 

The present invention encompasses the Zmaxl gene and Zmaxl protein in the forms 
indicated by SEQ ID NOS: 1 and 3, respectively, and other closely related variants, as well as 
the adjacent chromosomal regions of Zmaxl necessary for its accurate expression. In a 
preferred embodiment, the present invention is directed to at least 15 contiguous nucleotides 
of the nucleic acid sequence of SEQ ID NO: 1 . 

The present invention also encompasses the HBM gene and HBM protein in the forms 
indicated by SEQ ID NO: 2 and 4, respectively, and other closely related variants, as well as 
the adjacent chromosomal regions of the HBM gene necessary for its accurate expression. In 
a preferred embodiment, the present invention is directed to at least 15 contiguous nucleotides 
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of the nucleic acid sequence of SEQ ID NO: 2. More preferably, the present invention is 
directed to at least 15 contiguous nucleotides of the nucleic acid sequence of SEQ ID NO: 2, 
wherein one of the 15 contiguous nucleotides is the thymine at nucleotide 582. 

The invention also relates to the nucleotide sequence of the Zmaxl gene region, as 
5 well as the nucleotide sequence of the HBM gene region. More particularly, a preferred 

embodiment are the BAC clones containing segments of the Zmaxl gene region B200E21-H 
and B527D12-H. A preferred embodiment is the nucleotide sequence of the BAC clones 
consisting of SEQ ID NOS: 5-12. 

The invention also concerns the use of the nucleotide sequence to identify DNA 
10 probes for the Zmaxl gene and the HBM gene, PCR primers to amplify the Zmaxl gene and 
the HBM gene, nucleotide polymorphisms in the Zmaxl gene and the HBM gene, and 
regulatory elements of the Zmaxl gene and the HBM gene. 

This invention describes the further localization of the chromosomal location of the 
Zmaxl gene and HBM gene on chromosome llql3.3 between genetic markers D11S987 and 
15 SNP CONTIG033-6, as well as the DNA sequences of the Zmaxl gene and the HBM gene. 
The chromosomal location was refined by the addition of more genetic markers to the 
mapping panel used to map the gene, and by the extension of the pedigree to include more 
individuals. The pedigree extension was critical because the new individuals that have been 
genotyped harbor critical recombination events that narrow the region. To identify genes in 
20 the region on 1 1 ql 3 .3 , a set of BAC clones containing this chromosomal region was 

identified. The BAC clones served as a template for genomic DNA sequencing, and also as a 
reagent for identifying coding sequences by direct cDNA selection. Genomic sequencing and 
direct cDNA selection were used to characterize more than 1.5 million base pairs of DNA 
from 1 lql3.3. The Zmaxl gene was identified within this region and the HBM gene was 
25 then discovered after mutational analysis of affected and unaffected individuals. 

When a gene has been genetically localized to a specific chromosomal region, the 
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genes in this region can be characterized at the molecular level by a series of steps that 
include: cloning of the entire region of DNA in a set of overlapping clones (physical 
mapping), characterization of genes encoded by these clones by a combination of direct 
cDNA selection, exon trapping and DNA sequencing (gene identification), and identification 
of mutations in these genes by comparative DNA sequencing of affected and unaffected 
members of the HBM kindred (mutation analysis). 

Physical mapping is accomplished by screening libraries of human DNA cloned in 
vectors that are propagated in E. coli or S. cereviseae using PCR assays designed to amplify 
unique molecular landmarks in the chromosomal region of interest. To generate a physical 
map of the HBM candidate region, a library of human DNA cloned in Bacterial Artificial 
Chromosomes (BACs) was screened with a set of Sequence Tagged Site (STS) markers that 
had been previously mapped to chromosome 1 Iql2-ql3 by the efforts of the Human Genome 
Project. 

STSs are unique molecular landmarks in the human genome that can be assayed by 
PCR. Through the combined efforts of the Human Genome Project, the location of thousands 
of STSs on the twenty-two autosomes and two sex chromosomes has been determined. For a 
positional cloning effort, the physical map is tied to the genetic map because the markers used 
for genetic mapping can also be used as STSs for physical mapping. By screening a BAC 
library with a combination of STSs derived from genetic markers, genes, and random DNA 
fragments, a physical map comprised of overlapping clones representing all of the DNA in a 
chromosomal region of interest can be assembled. 

BACs are cloning vectors for large (80 kilobase to 200 kilobase) segments of human 
or other DNA that are propagated in E. coli. To construct a physical map using BACs, a 
library of BAC clones is screened so that individual clones harboring the DNA sequence 
corresponding to a given STS or set of STSs are identified. Throughout most of the human 
genome, the STS markers are spaced approximately 20 to 50 kilobases apart, so that an 
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individual BAC clone typically contains at least two STS markers. In addition, the BAC 
libraries that were screened contain enough cloned DNA to cover the human genome six 
times over. Therefore, an individual STS typically identifies more than one BAC clone. By 
screening a six-fold coverage BAC library with a series of STS markers spaced 
5 approximately 50 kilobases apart, a physical map consisting of a series of overlapping BAC 
clones, i.e. BAC contigs, can be assembled for any region of the human genome. This map is 
closely tied to the genetic map because many of the STS markers used to prepare the physical 
map are also genetic markers. 

When constructing a physical map, it often happens that there are gaps in the STS 

10 map of the genome that result in the inability to identify BAC clones that are overlapping in a 
given location. Typically, the physical map is first constructed from a set of STSs that have 
been identified through the publicly available literature and World Wide Web resources. The 
initial map consists of several separate BAC contigs that are separated by gaps of unknown 
molecular distance. To identify BAC clones that fill these gaps, it is necessary to develop 

15 new STS markers from the ends of the clones on either side of the gap. This is done by 

sequencing the terminal 200 to 300 base pairs of the BACs flanking the gap, and developing a 
PCR assay to amplify a sequence of 100 or more base pairs. If the terminal sequences are 
demonstrated to be unique within the human genome, then the new STS can be used to screen 
the BAC library to identify additional BACs that contain the DNA from the gap in the 

20 physical map. To assemble a BAC contig that covers a region the size of the HBM candidate 
region (2,000,000 or more base pairs), it is often necessary to develop new STS markers from 
the ends of several clones. 

After building a BAC contig, this set of overlapping clones serves as a template for 
identifying the genes encoded in the chromosomal region. Gene identification can be 

25 accomplished by many methods. Three methods are commonly used: (1) a set of BACs 

selected from the BAC contig to represent the entire chromosomal region can be sequenced, 
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and computational methods can be used to identify all of the genes, (2) the BACs from the 
BAC contig can be used as a reagent to clone cDNAs corresponding to the genes encoded in 
the region by a method termed direct cDNA selection, or (3) the BACs from the BAC contig 
can be used to identify coding sequences by selecting for specific DNA sequence motifs in a 
5 procedure called exon trapping. The present invention includes genes identified by the first 
two methods. 

To sequence the entire BAC contig representing the HBM candidate region, a set of 
BACs was chosen for subcloning into plasmid vectors and subsequent DNA sequencing of 
these subclones. Since the DNA cloned in the BACs represents genomic DNA, this 

10 sequencing is referred to as genomic sequencing to distinguish it from cDNA sequencing. To 
initiate the genomic sequencing for a chromosomal region of interest, several non- 
overlapping BAC clones are chosen. DNA for each BAC clone is prepared, and the clones 
are sheared into random small fragments which are subsequently cloned into standard 
plasmid vectors such as pUCl 8. The plasmid clones are then grown to propagate the smaller 

15 fragments, and these are the templates for sequencing. To ensure adequate coverage and 
sequence quality for the BAC DNA sequence, sufficient plasmid clones are sequenced to 
yield six-fold coverage of the BAC clone. For example, if the BAC is 100 kilobases long, 
then phagemids are sequenced to yield 600 kilobases of sequence. Since the BAC DNA was 
randomly sheared prior to cloning in the phagemid vector, the 600 kilobases of raw DNA 

20 sequence can be assembled by computational methods into overlapping DNA sequences 
termed sequence contigs. For the purposes of initial gene identification by computational 
methods, six-fold coverage of each BAC is sufficient to yield ten to twenty sequence contigs 
of 1000 base pairs to 20,000 base pairs. 

The sequencing strategy employed in this invention was to initially sequence "seed" 

25 BACs from the BAC contig in the HBM candidate region. The sequence of the "seed" BACs 
was then used to identify minimally overlapping BACs from the contig, and these were 
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subsequently sequenced. In this manner, the entire candidate region was sequenced, with 
several small sequence gaps left in each BAC. This sequence served as the template for 
computational gene identification. One method for computational gene identification is to 
compare the sequence of BAC contig to publicly available databases of cDNA and genomic 
5 sequences, e.g. unigene, dbEST, genbank. These comparisons are typically done using the 
BLAST family of computer algorithms and programs (Altschul et al, J, Mol Biol., 215:403- 
410 (1990)). The BAC sequence can also be translated into protein sequence, and the protein 
sequence can be used to search publicly available protein databases, using a version of 
BLAST designed to analyze protein sequences (Altschul et al, NucL Acids Res., 25:3389- 

10 3402 (1997)). Another method is to use computer algorithms such as MZEF (Zhang, Proc. 
Natl Acad. Set, 94:565-568 (1997)) and GRAIL (Uberbacher et al, Methods EnzymoL, 
266:259-281 (1996)), which predict the location of exons in the sequence based on the 
presence of specific DNA sequence motifs that are common to all exons, as well as the 
presence of codon usage typical of human protein encoding sequences. 

15 In addition to identifying genes by computational methods, genes were also identified 

by direct cDNA selection (Del Mastro et al, Genome Res. 5(2):185-194 (1995)). In direct 
cDNA selection, cDNA pools from tissues of interest are prepared, and the BACs from the 
candidate region are used in a liquid hybridization assay to capture the cDNAs which base 
pair to coding regions in the BAC. In the methods described herein, the cDNA pools were 

20 created from several different tissues by random priming the first strand cDNA from polyA 
RNA, synthesizing the second strand cDNA by standard methods, and adding linkers to the 
ends of the cDNA fragments. The linkers are used to amplify the cDNA pools. The BAC 
clones are used as a template for in vitro DNA synthesis to create a biotin labelled copy of the 
BAC DNA. The biotin labelled copy of the BAC DNA is then denatured and incubated with 

25 an excess of the PCR amplified, linkered cDNA pools which have also been denatured. The 
BAC DNA and cDNA are allowed to anneal in solution, and heteroduplexes between the 
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BAC and the cDNA are isolated using streptavidin coated magnetic beads. The cDNAs that 
are captured by the BAC are then amplified using primers complimentary to the linker 
sequences, and the hybridization/selection process is repeated for a second round. After two 
rounds of direct cDNA selection, the cDNA fragments are cloned, and a library of these direct 
5 selected fragments is created. 

The cDNA clones isolated by direct selection are analyzed by two methods. Since a 
pool of BACs from the HBM candidate region is used to provide the genomic DNA 
sequence, the cDNAs must be mapped to individual BACs. This is accomplished by arraying 
the BACs in microtiter dishes, and replicating their DNA in high density grids. Individual 

10 cDNA clones are then hybridized to the grid to confirm that they have sequence identity to an 
individual BAC from the set used for direct selection, and to determine the specific identity of 
that BAC. cDNA clones that are confirmed to correspond to individual BACs are sequenced. 
To determine whether the cDNA clones isolated by direct selection share sequence identity 
or similarity to previously identified genes, the DNA and protein coding sequences are 

15 compared to publicly available databases using the BLAST family of programs. 

The combination of genomic DNA sequence and cDNA sequence provided by BAC 
sequencing and by direct cDNA selection yields an initial list of putative genes in the region. 
The genes in the region were all candidates for the HBM locus. To further characterize each 
gene, Northern blots were performed to determine the size of the transcript corresponding to 

20 each gene, and to determine which putative exons were transcribed together to make an 

individual gene. For Northern blot analysis of each gene, probes were prepared from direct 
selected cDNA clones or by PCR amplifying specific fragments from genomic DNA or from 
the BAC encoding the putative gene of interest. The Northern blots gave information on the 
size of the transcript and the tissues in which it was expressed. For transcripts which were 

25 not highly expressed, it was sometimes necessary to perform a reverse transcription PCR 
assay using RNA from the tissues of interest as a template for the reaction. 
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Gene identification by computational methods and by direct cDNA selection provides 
unique information about the genes in a region of a chromosome. When genes are identified, 
then it is possible to examine different individuals for mutations in each gene. 

I. Phenotyping using DXA Measurements 

5 Spinal bone mineral content (BMC) and bone mineral density (BMD) measurements 

performed at Creighton University (Omaha, Nebraska) were made by DXA using a Norland 
Instruments densitometer (Norland XR2600 Densitometer, Dual Energy X-ray 
Absorptiometry, DXA). Spinal BMC and BMD at other locations used the machinery 
available. There are estimated to be 800 DXA machines currently operating in the U.S. Most 

10 larger cities have offices or imaging centers which have DXA capabilities, usually a Lunar or 
Hologic machine. Each location that provided spine BMC and BMD data included copies of 
the printouts from their machines to provide verification that the regions of interest for 
measurement of BMD have been chosen appropriately. Complete clinical histories and 
skeletal radiographs were obtained. 

15 The HBM phenotype is defined by the following criteria: very high spinal BMD; a 

clinical history devoid of any known high bone mass syndrome; and skeletal radiographs 
showing a normal shape of the appendicular skeleton. 

II. Genotyping of Microsatellite Markers 

To narrow the genetic interval to a region smaller than that originally reported by 
20 Johnson et al, Am. J. Hum. Genet., 60:1326-1332 (1997), additional microsatellite markers on 
chromosome 1 1 q 1 2- 1 3 were typed. The new markers included: D11S4191,D11S1883, 
D11S1785, D11S4113, D11S4136, D11S4139, (Dib, et al, Nature, 380:152-154 (1996), 
FGF3 (Polymeropolous, et al, Nucl Acid Res., 18:7468 (1990)), as well as 
GTCHBM JVfarkerl , GTC_HBM_Marker__2, GTC_HBM_Marker_3 5 
25 GTC_HBM_Marker_4, GTC JHBM JVtarker_5 , GTC_HBM_Marker_6, and 
GTC_HBM_MarkerJ7 (See Fig. 2). 
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Blood (20 ml) was drawn into lavender cap (EDTA containing) tubes by a certified 
phlebotomist. The blood was stored refrigerated until DNA extraction. DNA has been 
extracted from blood stored for up to 7 days in the refrigerator without reduction in the 
quality or quantity of yield. For those subjects that have blood drawn at distant sites, a 
shipping protocol was successfully used on more than a dozen occasions. Blood samples 
were shipped by overnight express in a styrofoam container with freezer packs to provide 
cooling. Lavender cap tubes were placed on individual plastic shipping tubes and then into 
"zip-lock" biohazard bags. When the samples arrived the next day, they were immediately 

processed to extract DNA. 
) The DNA extraction procedure used a kit purchased from Gentra Systems, Inc. 

(Minneapolis, Minnesota). Briefly, the procedure involved adding 3 volumes of a red blood 
cell lysis buffer to the whole blood. After incubations for 10 minutes at room temperature, 
the solution was centrifuged in a Beckman tabletop centrifuge at 2,000 X g for 10 minutes. 
The white blood cell pellet was resuspended in Cell Lysis Buffer. Once the pellet was 
5 completely resuspended and free of cell clumps, the solution was digested with RNase A for 
15 minutes at 37 C. Proteins were precipitated by addition of the provided Protein 
Precipitation Solution and removed by centrifugation. The DNA was precipitated out of the 
supernatant by addition of isopropanol. This method was simple and fast, requiring only 1-2 
hours, and allowed for the processing of dozens of samples simultaneously. The yield of 
20 DNA was routinely >8 mg for a 20 ml sample of whole blood and had a MW of >50 kb. 
DNA was archived by storing coded 50 ug aliquots at -80 C as an ethanol precipitate. 

DNA was genotyped using one fluorescently labeled oligonucleotide primer and one 
unlabeled oligonucleotide primer. Labeled and unlabeled oligonucleotides were obtained 
from Integrated DNA Technologies, Inc. (Coralville, Iowa). All other reagents for 
25 microsatellite genotyping were purchased from Perkin Elmer- Applied Biosystems, Inc. ("PE- 
ABF) (Norwalk, Connecticut). Individual PCR reactions were performed for each marker, as 
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described by PE-ABI using AmpliTag DNA Polymerase. The reactions were added to 3.5 ul 
of loading buffer containing deionized formamide, blue dextran and TAMRA 350 size 
standards (PE-ABI). After heating at 95 C for 5 minutes to denature the DNA, the samples 
were loaded and electrophoresed as described in the operator's manual for the Model 377 
5 DNA Sequencer (PE-ABI, Foster City, California). After gel electrophoresis, the data was 
analyzed using PE-ABI GENESCAN™ and GENOTYPER™ software. First, within the 
GENESCAN™ software, the lane tracking was manually optimized prior to the first step of 
analysis. After the gel lane data was extracted, the standard curve profiles of each lane were 
examined and verified for linearity and size calling. Lanes, which had problems with either 

10 of these parameters, were re-tracked and verified. Once all lanes were tracked and the size 
standards were correctly identified, the data were imported into GENOTYPER™ for allele 
identification To expedite allele calling (binning), the program Linkage Designer from the 
Internet web-site of Dr. Guy Van Camp (http://alt.www .uia.ac.be/u/dnalab/ld.html) was used. 
This program greatly facilitates the importing of data generated by GENOTYPER™ into the 

15 pedigree drawing program Cyrillic (Version 2.0, Cherwell Scientific Publishing Limited, 
Oxford, Great Britain) and subsequent linkage analysis using the program LINKAGE 
(Lathrop et al, Am. J, Hum. Genet, 37:482-498 (1985)), 
III. Linkage Analysis 

Fig. 1 demonstrates the pedigree of the individuals used in the genetic linkage studies 

20 for this invention. Specifically, two-point linkage analysis was performed using the MLINK 
and LINKMAP components of the program LINKAGE (Lathrop et al, Am. J, Hum. Genet, 
37:482-498 (1985)). Pedigree/marker data was exported from Cyrillic as a pre-file into the 
Makeped program and converted into a suitable ped-file for linkage analysis. 

The original linkage analysis was performed using three models: (i) an autosomal 

25 dominant, fully penetrant model, (ii) an autosomal dominant model with reduced penetrance, 
and (iii) a quantitative trait model. The HBM locus was mapped to chromosome 1 lql2-13 
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by analyzing DNA for linked markers from 22 members of a large, extended kindred. A 
highly automated technology was used with a panel of 345 fluorescent markers which 
spanned the 22 autosomes at a spacing interval ranging from 6-22 cM. Only markers from 
this region of chromosome 11 showed evidence of linkage (LOD score -3.0). The highest 
LOD score (5.74) obtained by two-point and multipoint analysis was Dl 1S987 (map position 
55 in Fig. 2). The 95% confidence interval placed the HBM locus between markers Dl 1 S905 
and Dl 1S937 (map position 41-71 in Fig. 2). Haplotype analysis also places the Zmaxl gene 
in this same region. Further descriptions of the markers Dl 1S987, D11S905, and D11S937 
can be found in Gyapay et al, Nature Genetics, Vol. 7, (1994). 

In this invention, the inventors report the narrowing of the HBM interval to the region 
between markers Dl 1S987 and GTC_HBM_Marker_5 . These two markers lie between the 
delimiting markers from the original analysis (Dl 1S1 1S905 and Dl 1 S937) and are 
approximately 3 cM from one another. The narrowing of the interval was accomplished 
using genotypic data from the markers Dl 1S4191, D11S1883, Dl 1S1785, Dl 1S4113, 
D11S4136, D11S4139, (Dib et al, Nature, 380:152-154 (1996)), FGF3 (Polymeropolous et 
al, Nucl. Acid Res., 18:7468 (1990)) (information about the genetic markers can be found at 
the internet site of the Genome Database, http://gdbwww.gdb.org/), as well as the markers 
GTC_HBM JVlarker_l , GTC_HBM_Marker_2, GTC_HBM_Marker_3 , 
GTC_HBM_Marker_4, GTC_HBM_Marker_5 , GTC_HBM_Marker_6, and 

GTC_HBM_Marker_7. 

As shown in Fig. 1, haplotype analysis with the above genetic markers identifies 
recombination events (crossovers) in individuals 9019 and 9020 that significantly refine the 
interval of chromosome 1 1 to which the Zmaxl gene is localized. Individual 9019 is an 
HBM-affected individual that inherits aportion of chromosome 11 from the maternal 
chromosome with the HBM gene, and a portion from the chromosome 1 1 homologue. The 
portion inherited from the HBM gene-carrying chromosome includes markers D11S935, 
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D11S1313, GTCJHBM_Marker_4, D11S987 ? D11S1296, GTCJ3BM_Marker_6, 
GTC_JTBM_Marker__2, Dl 1S970, GTC JTBM_MarkerJ3 , Dl 1S41 13, 
GTC_HBM_Marker_l, GTC_HBMJVfarker_7 and GTCjHBM JMarker_5 . The portion 
from Dl 1S4136 and continuing in the telomeric direction is derived from the non-HBM 
5 chromosome. This data places the Zmaxl gene in a location centromeric to the marker 
GTCJHBM__Marker_5. Individual 9020 is an unaffected individual who also exhibits a 
critical recombination event. This individual inherits a recombinant paternal chromosome 1 1 
that includes markers D11S935, Dl 1S1313, GTC_HBM_Marker_4, Dl 1S987, Dl 1S1296 
and GTC_HBM_Marker_6 from her father's (individual 01 15) chromosome 1 1 homologue 

10 that carries the HBM gene, and markers GTC_HBM_Marker_JZ, Dl 1 S970, 
GTC_HBM_Marker_3, GTC JHBM_Marker__l , GTCJHBM_Marker_7, 
GTC_HBM_Marker_5 ? D11S4136, D11S4139, D11S1314, and D11S937 from her father's 
chromosome 1 1 that does not carry the HBM gene. Marker Dl 1S41 13 is uninformative due 
to its homozygous nature in individual 01 15, This recombination event places the 

15 centromeric boundary of the HBM region between markers D 1 1 S 1 296 and D 1 1 S987. 

Two-point linkage analysis was also used to confirm the location of the Zmaxl gene 
on chromosome 1 1 . The linkage results for two point linkage analysis under a model of full 
penetrance are presented in Table 1 below. This table lists the genetic markers in the first 
column and the recombination fractions across the top of the table. Each cell of the column 

20 shows the LOD score for an individual marker tested for linkage to the Zmaxl gene at the 
recombination fraction shown in the first row. For example, the peak LOD score of 7.66 
occurs at marker Dl 1S970, which is within the interval defined by haplotype analysis. 



TABLE 1 



Marker 


0.0 


0.05 


0.1 


0.15 


0.2 


0.25 


0.3 


0.35 


0.4 
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D11S935 


- infinity 


0.39 


0.49 


0.47 


0.41 


0.33 


0.25 


0.17 


0.10 


D11S1313 


- infinity 


2.64 


2.86 


2.80 


2.59 


2.30 


1.93 


1.49 


1.00 


D11S987 


- infinity 


5.49 


5.18 


4.70 


4.13 


3.49 


2.79 


2.03 


1.26 


D11S4113 


4.35 


3.99 


3.62 


3.24 


2.83 


2.40 


1.94 


1.46 


0.97 


D11S1337 


2.29 


2.06 


1.81 


1.55 


1.27 


0.99 


0.70 


0.42 


0.18 


D11S970 


7.66 


6.99 


6.29 


5.56 


4.79 


3.99 


3.15 


2.30 


1.44 


D11S4136 


6.34 


5.79 


5.22 


4.61 


3.98 


3.30 


2.59 


1.85 


1.11 


D11S4139 


6.80 


6.28 


5.73 


5.13 


4.50 


3.84 


3.13 


2.38 


1.59 


FGF3 


0.59 


3.23 


3.15 


2.91 


2.61 


2.25 


1.84 


1.40 


0.92 


D11S1314 


6.96 


6.49 


5.94 


5.34 


4.69 


4.01 


3.27 


2.49 


1.67 


D11S937 


-infinity 


4.98 


4.86 


4.52 


4.06 


3.51 


2.88 


2.20 


1.47 



A single nucleotide polymorphism (SNP) further defines the HBM region. This SNP 
is termed SNP_Contig033-6 and is located 25 kb centromeric to the genetic marker 
GTC_HBM_Marker_5 . This SNP is telomeric to the genetic marker GTC_HBM_Marker_7. 
SNP_Contig033-6 is present in HBM- affected individual 0113. However, the HBM-affected 
individual 9019, who is the son of 01 13, does not carry this SNP. Therefore, this indicates 
that the crossover is centromeric to this SNP, The primer sequence for the genetic markers 
GTCJHBM_Marker_5 and GTC_HBM_Marker_7 is shown in Table 2 below. 



TABLE 2 



Marker 


Primer (Forward) 


Primer (Reverse) 
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GTC_HBM_Marker 
_5 


"TTTTGGGT AC ACAATTCACi 1 C 
G 


AAAACTGTGGGTGCTTCT 
GG 


GTC_HBM_Marker 

J 


GTGATTGAGCCAATCCTGAGA 


TGAGCCAAATAAACCCCT 
TCT 



The kindred described have several features of great interest, the most important being 
that their bones, while very dense, have an absolutely normal shape. The outer dimensions of 
the skeletons of the HBM-affected individuals are normal, and, while medullary cavities are 
5 present, there is no interference with hemopoiesis. The HBM-affected members seem to be 
resistant to fracture, and there are no neurologic symptoms, and no symptoms of impairment 
of any organ or system function in the members examined. HBM-affected members of the 
kindred live to advanced age without undue illness or disability. Furthermore, the HBM 
phenotype matches no other bone disorders such as osteoporosis, osteoporosis pseudoglioma, 
10 Engelmann's disease, Ribbing's disease, hyperphosphatemia, Van Buchem's disease, 

melorheostosis, osteopetrosis, pycnodysostosis, sclerostenosis, osteopoikilosis, acromegaly, 
Paget's disease, fibrous dysplasia, tubular stenosis, osteogenesis imperfecta, 
hypoparathyroidism, pseudohypoparathyroidism, pseudopseudohypoparathyroidism, primary 
and secondary hyperparathyroidism and associated syndromes, hypercalciuria, medullary 
15 carcinoma of the thyroid gland, osteomalacia and other diseases. Clearly, the HBM locus in 
this family has a very powerful and substantial role in regulating bone density, and its 
identification is an important step in understanding the pathway(s) that regulate bone density 
and the pathogenesis of diseases such as osteoporosis. 

In addition, older individuals carrying the HBM gene, and therefore expression of the 
20 HBM protein, do not show loss of bone mass characteristic of normal individuals. In other 
words, the HBM gene is a suppressor of osteoporosis. In essence, individuals carrying the 
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HBM gene are dosed with the HBM protem, and, as a result, do not develop osteoporosis. 
This in vivo observation is strong evidence that treatment of normal individuals with the 
HBM gene or protein, or a fragment thereof, will ameliorate osteoporosis. 

IV. Physical Mapping 

To provide reagents for the cloning and characterization of the HBM locus, the 
genetic mapping data described above were used to construct a physical map of the region 
containing Zmaxl on chromosome 1 1,13.3. The physical map consists of an ordered set of 
molecular landmarks, and a se, of BAC clones mat contain the Zmaxl gene region from 

chromosome llql3.3. 

Various publicly available mapping resources were utilized to identify existing STS 
markers (Olson et al, Science, 245:1434-1435 (1989)) in the HBM region. Resources 
included the GDB, the Whitehead Institute Genome Center, dbSTS and dbEST (NCBI), 
1 Idb, the University of Texas Southwestern GESTEC, the Stanford Human Genome Center, 
and several literature references (Courseaux et al, Genomes, 40:13-23 (1997), Courseaux et 
15 al, Genomics, 37:354-365 (1996), Guru et al, Genomics, 42:436-445 (1997), Hosoda et al, 

Genes Cells, 2:345-357 (1997), James et al, Nat. Genet., 8:70-76 (1994), Kitamura et al, DMA 
Research, 4:281-289 (1997), Lemmens et al, Genomics, 44:94-100 (1997), Smith et al, 
Genome Res., 7:835-842 (1997)). Maps were integrated manually to identify markers 
mapping to the region containing Zmaxl. 
20 Primers for existing STSs were obtained from the GDB or literature references are 

listed in Table 3 below. Thus, Table 3 shows the STS markers used to prepare the physical 
map of the Zmaxl gene region. 
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Gene Name | 


Actinin, alpha 3 - skeletal muscle 1 


Pyruvate Carboxylase I 


Adenosine Receptor (A2} Gene 


Beta-adrenergic receptor kinase 


sim. to Human endogenous retrovirus mRNA long terminal repeat 


Protein phosphatase 1 , catalytic subunit, alpha tsoform I 


Glutathione S -transferase pi 


NADH dehydrogenase (ubiquinone) flavoprotein 1 (51 kD) 


Aldehyde Dehydrogenase 8 (ALDH8) 


Human ribosomal protein L37 (PSANK1 ) pseudogene. I 






Preprogalanin (GAL 1) I 




B-cell CLUIymphoma 1 - Cyclin D1 (PRAD1 gene) 


Cyclin D1 


Fibroblast growth factor 4 


Fibroblast growth factor 3 I 












Choline Kinase I 
















Transform ation-sensistive protein IEF SSP 3521 


ZNF162- splicing factor 1 






Mitogen inducible gene (MIG-2) I 






Human tat interactive protein (TIP60) 


Calcium activated neutral protease large subunit, muCANP, calpain 














HIark 


i 

X 












Human 1.1 kb mRNA upregulated in retinoic acid treated HL-60 neutrophilic ce 




Human pyruvate carboxylase precursor 




LAR-interacting protein 1 b I 








Reverse Primer 


TTCAGAAGCACTTGGCTGG 


CAAGATCACTCGATCTCCAGG 


TTCTG C AGGTTGCTGTTG AG 


GCCCTCTGTCCTGACTTCAGG 


TGCTTTGTAAAGCACTGAGA 


ATACACCAAGGTCCATGTTCCCCGT 


TCCCGGAGCTTGCACACCCGCTTCA 


CAAGATTCTGTAGCTTCTGG 


ATCCTCTCACATCCCACACT 


TCAGGAGCATTTCATCTTTT 


GCCCTGTGTTCCTTTCAGTA 


AGCTCATGGGGGCTATT 


ATGGCAGAGGACTTAGAACA 


TCCACATTGAGGACTGTGGGAACG 


TTGCACTGTCTTGGATGCA 


CAGGCGCAAAGGACATGCACACGGC 


CAGACAGAGATGCTCCACGCCATAC 


ACACAGTTGCTCTAAAGGGT 


TAGGIGICITAI II 1 1 IGI IGC'I fC 


CAACCCATACCAGGGATAAG 


TGAGGACACAGATACTGATGGG 


GAACTATATTGTAGTTAGTGAGGAG 


TCTTGCTTCCTAAGTTTCTCGG 


TGCTGTTTGCCTCATCTGAC 


TGTTCACTCTTCTGCCTGCAG 


CAAGAGGCTGGTAGAAGGTG 


GGTGGCAGCATGACCTCTAAAG j 


CGTGTCCAGATGAAAGTG I 


CTTGAAGCCCATCTTTGC ! 


AATCACTGTGCTTTGTTGCC ] 


ACTCCCTCGATGGCTTCC I 


CCCMCTGGCTTGTTTTATTG | 


MGAGTGMCAAAAGCAMCATACC ] 


TACTGTTCTTGATMGTATGTCGGC | 


TCCCCCMAAGAATGTAMGG | 


ATCACCCAGGCCAGGGAT I 


CCTGCTTGAMGTTCTAGAGCC I 


GATGCCAGGACCATGGAC | 


CTCTGMGCAGGGACCAGAG I 


CAAGCGAMGCTG CCTTC I 


o 
o 
1- 

I 

< 

I 

o 

i 


TACGTGGCCMGMGCTAGG 1 


AGCTTGCAGATGGAGCCC I 


TGTTGATCTATACCCTGTTTCCG 1 


TGGCTGTGAACTTCCTCTGA I 


TGAGCTTTAGTTCCCTTCTCTG | 


TCTGCGGCTGTTGGATTT I 


TGTTCTCTTCTCCCAGCAGG I 


TTGTCAMTTCCCCCCAAAA I 


CCCTGGAMGGTAAGATGCT j 


TATCTGTCTGTAGTGCTTCAAATGT I 


ACTGMGMCTCTTGTCCT I 


CACTTCTCCCACTTTGTCCC I 


TGGCAAGTTAGGCACAGTCA | 


TCCACTMGGGCTATGTCGC l 


CACTGGAGACTACMGTGGTGG I 


l 

O 
< 

i 

o 

% 

cd 
o 

CD 
CD 


TGAMGGATATTTATAGCCTGGA I 


TGAGGGTTGGGAAGATCATA I 


CAGCTMCTGTTGACATGCCA | 


CMCAGTG CAGTCGGTATCG | 


Forward Primer | 


CTGGACTACGTGGCCTTCTC I 


CTCAGTGCCATGAAGATGGA | 


GTTTCAGGAGACTCAGAGTC I 


TTATTGTGATTTCCCGTGGC I 


GAGAAAGAAATAAGGGGACC I 


GAAGTACGGGCAGTTCAGTGGCCT | 


AGCCTGGGCCACAGCGTGAGACTAC | 


CATGTGCCCACCTCATTCAT I 


CAGAGAAGTCAAGGGACTTG I 


CAAGGCTAAAAGACGAAAAA | 


AAGTCGAGGCTGCAAGGAG I 


AAGGTGTGAGGATCACTGG I 


GCTTCTCCGAGTGTATCAAC I 


GATCAGCGMCTTCCTCTCGGCTC | 


GCTAATCACAGTCTAACCGA I 


GCACAGCTGTAGTGGGGTTCTAGGC i 


CACCGATGAGTGCACGTTCAAGGAG | 


TTTCTGGGTGTGTCTGAAT I 


CATTTGGGAAATCCAGAAGA | 


GACATACCATGAACACTATAAGAGG I 


GAACAAGAGGG GTAAGTTGGC | 


GAAGTGTTCCCTCTTAAATTCTTTG | 


CCTGTAACCCCCAGTCCC I 


ACTCCATCCACCTCATCACTG j 


GTGGACAGGCATAGCTGAGG | 


AGCTGGACTCTCACAGAATG I 


GACTCCAGTCTGGGCAATAAAAGC | 


CAGGCCCAGTCTCTTG | 


ACCTCACGGTGTAATCCC I 


TATTTGCAAAGCTTGAGACTTCT | 


ACTTTATTGTCAGCGTGGGC | 


GAGCAGGGGAGAGAAGGC I 


AGCCACTTTATTGTTATTTTGATGC | 


GTGGAGTGTGGGATTGGG I 


ATGCTTTTGCATGATTCTAATTATT | 


CTGGTCTTCCTTGTGTGCTG I 


TCAGMGCAGAACTGTTTTTMCA | 


CMGCCGGGTTTTATTGAAA | 


GCATATAGAMCMTTTATTGCCG | 


CTACCACACCACACCAGGC I 


GTTGTCTTGACTTCAGGTCTGTC | 


GCGTGGGGATATAGAGGTCA | 


TMTATATCCC CAGTCTMGGCAT | 


TGGTTTTAAACCTTTAATGAGAAM | 


MTTATTTAAAAGAGAGGAMGGCA | 


GGTTACAGAAAAACATTTGAGAGAT | 


TTGAAAMCCATTTATTTCACCG | 


TTGAAAMCCATTTATTTCACCG | 


CTTTATTGAAAACATTGAGTGCA | 


AMCCACGACCNCCAA I 


CTTTTGGTAGAGACAAGGTCTCA | 


GACGMGGTGATTCAGGGC | 


CAGATAAAAGAGTCACTATGGCTCA I 


TTATTGATAAGCATTAGTGMCCCC | 


CTATGCCCAGAGATGMCAGG I 


GCCAGCTTTATTGAGTAMCTTCC [ 


CATCCCMCCATCACTCAGT | 


AGACTACATTTTGGAACCAGTGG I 


1 

o 
i- 

o 
o 

1 

CD 


CCTTCATAGCCACACCCG | 


TCTTTACTGTGCTTACMCTTTCCT | 


Size (kb| 


0.164| 


0.1251 


0.322 1 


0.1171 


0.259 1 


0.208 1 


0.191 


0.521 1 


0.1571 


CO 

d 


0.21 1 i 


0.287 1 


0.322i 




0.205! 


0.248 i 


0.549; 


0.1611 


0.22! 


0.275| 


0.1471 


0.167 1 


0.158| 


0.311] 


0.166| 


0.183] 


0.082 1 


0.237 1 


0.218| 


0.15| 


0.126| 


0.324 1 


0.13| 


0,151 


CM 
CM 

d 


0.125| 


0.128| 


0.1261 


0.199| 


0.207 1 


0.15| 


0.13) 


0.151 


0.124| 


0.1271 


0.1131 


0.131| 


0.209] 


0.15j 


0.181j 


0.15] 


0.135) 


d 


0.1331 


d 


0.128| 


0.1251 


0.223 1 


0.1371 


0.174| 


0.15) 


GDB Access. 


GDB: 197568 


GDB:1 97884 | 




GDB:4590179| 




GDB:1 97566] 


GDB:270066 ! 








CN 

in 

CO 

CD 
Q 
CD 


o 

CO 

o> 

<Si 

CD 
Q 
CD 




GDB:1 77850 




? 

o 

CD 
LO 
T 
CQ 
Q 
CD 


CO 

o 

OJ 
LO 

■jr 

CD 
Q 
CD 


GDB: 188627] 


CO 
CO 

CO 

o 

cd 


GDB: 1222329] 


GDB:740600 | 


GDB:740102 | 


GDB:740516] 


CN 
CN 

LO 

N~ 

to 
cd 
Q 
CD 


GDB: 196290 ] 


GDB:307895 ] 


GDB: 195002 | 


GDB:611922 | 


GDB:608115 | 


GDB:4581644| 


CO 

f-~ 
CO 

T 
00 
Ct 
CD 


GDB:677652l 


LO 
CN 
CO 

to 

T 

co 

Q 

o 


r~- 

CO 
CO 

r*- 
m 

cd 
Q 
CD 


GDB:678804 | 


LO 

s 

TT 
CO 

m 
co 

Q 




CO 
CO 

in 

OQ 
Q 
CD 


GDB:1222237| 


CM 
CO 

CO 

1*- 
in 

T 

od 
Q 
CD 


§ 

T 

CO 

in 

N" 

0Q 
Q 
O 




S 

CO 
CN 
CO 

in 
cc 

Q 

CD 


GDB:1222235| 


GDB: 1222257] 


GDB:4581874| 


£ 

■q- 
co 
in 

0Q 
Q 

CD 


GDB:4576606l 


GDB:4581106l 


GDB: 1222332] 


CO 

s 

to 
in 

N~ 

m 
a 

cd 


GDB:4581592| 


GDB:4583084| 


$ 

In 

NT 
CO 
Q 
O 


GDB:4567265| 


o 

CO 

to 

tT 
CO 
Q 
CD 


GDB:678546 | 


CM 
CO 

r- 

m 

CQ 
Q 
CD 


$ 

CM 
CO 
CM 

to 

CQ 
Q 
CD 


co 

CO 
LO 

CQ 
Q 
O 


CO 

in 

cd 
a 

CD 


CD 
CL 


Gene 


Gene 


Gene ! 


Gene j 


GENE! 


Gene i 


Gene j 


Gene ' 


GENE! 


EST i 


MSAT 


MSAT 


Gene : 


VNTR 


Gene 


Gene 


Gene 


Gene 


MSAT 


MSAT 


STS ; 


STS i 


STS j 


Gene j 


STS 


MSAT] 


MSAT 


MSAT 


MSAT] 


EST 


EST 


GENE| 


EST 


EST 


EST 


EST 


EST 


EST | 


J 


Gene | 


EST 


EST 


EST 


EST 


EST | 


EST 


EST 


EST | 


EST 


MSAT] 


EST 


EST 


EST 


EST 


EST 


Gene | 


STS 


Gene j 


STS 


EST 


EST | 


Locus Nam 






D11S2161 | 
















D11S1917 | 


D11S1337 




D11S97 | 










D11S913 | 




CO 
CO 
CO 

CO 

5 


D11S4540 


D11S4664 I 


D11S4327 | 


D11S703 | 


D11S1889 | 


D11S987 | 


D11S4178 


D11S4113 | 






D11S4364 | 






D11S4418 | 












































D11S4407 | 




D11S42701 


D11S3974 | 




]STS Name | 


jACTN3 | 


PC-B/PC-Y \ 




ADRBK1 | 


PSANK3 | 


1 

K. 

c\T 

51 

a. 


;GSTP1.PCR1 I 


NDUFV1 | 


PSANK2 | 


PSANK1 I 


IUT5620 | 


CD 

5, 

CD 
CO 
CN 

2 
< 


IGALN I 


55 

CO 

2 

CL 


Ij 

o 

CO 

o 

CO 


CCND1 | 


!FGF4 | 


FGF3.PCR1 I 


AFM164ZF12 I 


AFMA190YD5 | 


SHGC-15295 | 


SHGC-3084 


SHGC-14407 | 


SHGC-10946 | 


S515 j 


AFM147XD10 I 


AFMA131YE5 j 


X 
CO 

m 

s 

Ll. 
< 


>- 

CM 

^ 
2 

< 


Wl-17803* | 


SGC31923 | 


Wl-7741 I 


SGC35223 | 


s 
I 


Wi-6315 | 


Wi-16915 | 


SGC30608 | 


WI-17663 


CO 

9 

i 


SGC31567 I 


SGC30658 | 


SGC34590 I 


SGC33927 | 


W1-B671 


WI-12334 I 


Wl-18402 | 


WI-18671 


1 

CO 
CM 

i 


SGC33767 I 


AFM343YB5 


SGC33744 I 


SGC32272 


SGC34148 I 


i Wi-18546 


SGC31103 


SGC30028 | 


Wl-2875 ! 


SGC36985 I 


GCT16B07 I 


Wl-6504 I 


SGC31049 I 
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Novel STSs were developed either from publicly available genomic sequence or from 
sequence-derived BAC insert ends. Primers were chosen using a script which automatically 
performs vector and repetitive sequence masking using Crossmatch (P. Green, U. of 
Washington) and subsequent primer picking using Primer3 (Rozen, Skaletsky (1996, 1997). 
5 Primer3 is available at www.genome.wi.mit. edu/genome_software/other/primer3 .html. 

Polymerase chain reaction (PCR) conditions for each primer pair were initially 
optimized with respect to MgCl 2 concentration. The standard buffer was 10 mM Tris-HCl 
( P H 8.3), 50 mM KC1, MgCl 2 , 0.2 mM each dNTP, 0.2 uM each primer, 2.7 ng/^1 human 
DNA, 0.25 units of AmpliTaq (Perkin Elmer) and MgCl 2 concentrations of 1.0 mM, 1.5 mM, 
10 2.0 mM or 2.4 mM. Cycling conditions included an initial denaturation at 94 C for 2 minutes 
followed by 40 cycles at 94 C for 15 seconds, 55 C for 25 seconds, and 72 C for 25 seconds 
followed by a final extension at 72 C for 3 minutes. Depending on the results from the initial 
round of optimization the conditions were further optimized if necessary. Variables included 
increasing the annealing temperature to 58 C or 60 C, increasing the cycle number to 42 and 
15 the annealing and extension times to 30 seconds, and using AmpliTaqGold (Perkin Elmer). 

BAC clones (Kim et al, Genomics, 32:213-218 (1996), Shizuya et al, Proc. Natl. 
Acad. Sci. USA, 89:8794-8797 (1992)) containing STS markers of interest were obtained by 
PCR-based screening of DNA pools from a total human BAC library purchased from 
Research Genetics. DNA pools derived from library plates 1-596 were used corresponding to 
20 nine genomic equivalents of human DNA. The initial screening process involved PCR 

reactions of individual markers against superpools, i.e., a mixture of DNA derived from all 
BAC clones from eight 384-well library plates. For each positive superpool, plate (8), row 
(16) and column (24) pools were screened to identify a unique library address. PCR products 
were electrophoresed in 2% agarose gels (Sigma) containing 0.5 ug/ml ethidium bromide in 
25 IX TBE at 1 50 volts for 45 min. The electrophoresis units used were the Model A3-1 

systems from Owl Scientific Products. Typically, gels contained 10 tiers of lanes with 50 
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wells/tier. Molecular weight markers (100 bp ladder, Life Technologies, Bethesda, MD) 
were loaded at both ends of the gel. Images of the gels were captured with a Kodak DC40 
CCD camera and processed with Kodak ID software. The gel data were exported as tab 
delimited text files; names of the files included information about the library screened, the gel 
image files and the marker screened. These data were automatically imported using a 
customized Perl script into Filemaker PRO (Claris Corp.) databases for data storage and 
analysis. In cases where incomplete or ambiguous clone address information was obtained, 
additional experiments were performed to recover a unique, complete library address. 

Recovery of clonal BAC cultures from the library involved streaking out a sample 
from the library well onto LB agar (Maniatis et al, Molecular Cloning: A Laboratory 
Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY (1982)) containing 12.5 
ug/ml chloramphenicol (Sigma). Two individual colonies and a portion of the initial streak 
quadrant were tested with appropriate STS markers by colony PCR for verification. Positive 
clones were stored in LB broth containing 12.5 ug/ml chloramphenicol and 15% glycerol at - 
70 C. 

Several different types of DNA preparation methods were used for isolation of BAC 
DNA. The manual alkaline lysis miniprep protocol listed below (Maniatis et al, Molecular 
Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, NY 
(1982)) was successfully used for most applications, i.e., restriction mapping, CHEF gel 
analysis, FISH mapping, but was not successfully reproducible in endsequencing. The 
Autogen and Qiagen protocols were used specifically for BAC DNA preparation for 
endsequencing purposes. 

Bacteria were grown in 15 ml Terrific Broth containing 12.5 ug/ml chloramphenicol 
in a 50 ml conical tube at 37 C for 20 hrs with shaking at 300 rpm. The cultures were 
centrifuged in a Sorvall RT 6000 D at 3000 rpm (-1800 g) at 4 C for 15 min. The supernatant 
was then aspirated as completely as possible. In some cases cell pellets were frozen at -20 C 
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at this step for up to 2 weeks. The pellet was then vortexed to homogenize the cells and 
minimize clumping. 250 ul of PI solution (50 mM glucose, 15 mM Tris-HCl, pH 8, 10 mM 
EDTA, and 100 ug/ml RNase A) was added and the mixture pipetted up and down to mix. 
The mixture was then transferred to a 2 ml Eppendorf tube. 350 ul of P2 solution (0.2 N 
NaOH, 1% SDS) was then added, the mixture mixed gently and incubated for 5 min. at room 
temperature. 350 ul of P3 solution (3M KOAc, pH 5.5) was added and the mixture mixed 
gently until a white precipitate formed. The solution was incubated on ice for 5 min. and then 
centrifuged at 4 C in a microfuge for 10 min. The supernatant was transferred carefully 
(avoiding the white precipitate) to a fresh 2 ml Eppendorf tube, and 0.9 ml of isopropanol was 
added, the solution mixed and left on ice for 5 min. The samples were centrifuged for 10 
mm., and the supernatant removed carefully. Pellets were washed in 70% ethanol and air 
dried for 5 min. Pellets were resuspended in 200 ul of TE8 (10 mM Tris-HCl, pH 8.0, 1.0 
mM EDTA), and RNase A (Boehringer Mannheim) added to 100 ug/ml. Samples were 
incubated at 37 C for 30 min., then precipitated by addition of C 2 H 3 0 2 Na 3H 2 0 to 0.5 M and 2 
volumes of ethanol. Samples were centrifuged for 10 min., and the pellets washed with 70% 
ethanol followed by air drying and dissolving in 50 ul TE8. Typical yields for this DNA prep 
were 3-5 ug/15 ml bacterial culture. Ten to 15 ul were used for HindHI restriction analysis; 5 
ul was used for NotI digestion and clone insert sizing by CHEF gel electrophoresis. 

BACs were inoculated into 15 ml of 2X LB Broth containing 12.5 ug/ml 
chloramphenicol in a 50 ml conical tube. 4 tubes were inoculated for each clone. Cultures 
were grown overnight (-16 hr) at 37 C with vigorous shaking (>300 rpm). Standard 
conditions for BAC DNA isolation were followed as recommended by the Autogen 740 
manufacturer. 3 ml samples of culture were placed into Autogen tubes for a total of 60 ml or 
20 tubes per clone. Samples were dissolved finally in 100 ul TE8 with 15 seconds of shaking 
as part of the Autogen protocol. After the Autogen protocol was finished DNA solutions 
were transferred from each individual tube and pooled into a 2 ml Eppendorf tube. Tubes 
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with large amounts of debris (carry over from the pelleting debris step) were avoided. The 
tubes were then rinsed with 0.5 ml of TE8 successively and this solution added to the pooled 
material. DNA solutions were stored at 4 C; clumping tended to occur upon freezing at -20 C. 
This DNA was either used directly for restriction mapping, CHEF gel analysis or FISH 
mapping or was further purified as described below for use in endsequencing reactions. 

The volume of DNA solutions was adjusted to 2 ml with TE8, samples were then 
mixed gently and heated at 65 C for 10 min. The DNA solutions were then centrifuged at 4 C 
for 5 min. and the supernatants transferred to a 15 ml conical tube. The NaCl concentration 
was then adjusted to 0.75 M (-0.3 ml of 5 M NaCl to the 2 ml sample). The total volume 
was then adjusted to 6 ml with Qiagen column equilibration buffer (Buffer QBT). The 
supernatant containing the DNA was then applied to the column and allowed to enter by 
gravity flow. Columns were washed twice with 10 ml of Qiagen Buffer QC. Bound DNA 
was then eluted with four separate 1 ml aliquots of Buffer QF kept at 65 C. DNA was 
precipitated with 0.7 volumes of isopropanol (-2.8 ml). Each sample was then transferred to 
4 individual 2.2 ml Eppendorf tubes and incubated at room temperature for 2 hr or overnight. 
Samples were centrifuged in a microfuge for 10 min. at 4 C. The supernatant was removed 
carefully and 1 ml of 70% ethanol was added. Samples were centrifuged again and because 
the DNA pellets were often loose at this stage, the supernatant removed carefully. Samples 
were centrifuged again to concentrate remaining liquid which was removed with a micropipet 
tip. DNA pellets were then dried in a desiccator for 10 min. 20 jil of sterile distilled and 
deionized H 2 0 was added to each tube which was then placed at 4 C overnight. The four 20 
Hi samples for each clone were pooled and the tubes rinsed with another 20 ul of sterile 
distilled and deionized H 2 0 for a final volume of 100 ul. Samples were then heated at 65 C 
for 5 min. and then mixed gently. Typical yields were 2-5 ng/60 ml culture as assessed by 
NotI digestion and comparison with uncut lambda DNA. 

3 ml of LB Broth containing 12.5 ng/ml of chloramphenicol was dispensed into 
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autoclaved Autogen tubes. A single tube was used for each clone. For inoculation, glycerol 
stocks were removed from -70 C storage and placed on dry ice. A small portion of the 
glycerol stock was removed from the original tube with a sterile toothpick and transferred 
into the Autogen tube; the toothpick was left in the Autogen tube for at least two minutes 
before discarding. After inoculation the tubes were covered with tape making sure the seal 
was tight. When all samples were inoculated, the tube units were transferred into an Autogen 
rack holder and placed into a rotary shaker at 37 C for 16-17 hours at 250 rpm. Following 
growth, standard conditions for BAC DNA preparation, as defined by the manufacturer, were 
used to program the Autogen. Samples were not dissolved in TE8 as part of the program and 
DNA pellets were left dry. When the program was complete, the tubes were removed from 
the output tray and 30 ul of sterile distilled and deionized H 2 0 was added directly to the 
bottom of the tube. The tubes were then gently shaken for 2-5 seconds and then covered with 
parafilm and incubated at room temperature for 1-3 hours. DNA samples were then 
transferred to an Eppendorf tube and used either directly for sequencing or stored at 4 C for 
later use. 

V. BAC Clone Characterization for Physical Mapping 

DNA samples prepared either by manual alkaline lysis or the Autogen protocol were 
digested with Hindlll for analysis of restriction fragment sizes. This data were used to 
compare the extent of overlap among clones. Typically 1-2 ug were used for each reaction. 
Reaction mixtures included: IX Buffer 2 (New England Biolabs), 0.1 mg/ml bovine serum 
albumin (New England Biolabs), 50 ug/ml RNase A (Boehringer Mannheim), and 20 units of 
Hindlll (New England Biolabs) in a final volume of 25 ul. Digestions were incubated at 37 C 
for 4-6 hours. BAC DNA was also digested with NotI for estimation of insert size by CHEF 
gel analysis (see below). Reaction conditions were identical to those for Hindlll except that 
20 units of NotI were used. Six ul of 6X Ficoll loading buffer containing bromphenol blue 
and xylene cyanol was added prior to electrophoresis. 
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HindHI digests were analyzed on 0.6% agarose (Seakem, FMC Bioproducts) in IX 
TBE containing 0.5 ug/ml ethidium bromide. Gels (20 cm X 25 cm) were electrophoresed in 
a Model A4 electrophoresis unit (Owl Scientific) at 50 volts for 20-24 hrs. Molecular weight 
size markers included undigested lambda DNA, HindHI digested lambda DNA, and Haelll 
digested _X174 DNA. Molecular weight markers were heated at 65 C for 2 min. poor to 
loading the gel. Images were captured with a Kodak DC40 CCD camera and analyzed with 
Kodak ID software. 

NotI digests were analyzed on a CHEF DRII (BioRad) electrophoresis unit according 
to the manufacturer's recommendations. Briefly, 1% agarose gels (BioRad pulsed field 
grade) were prepared in 0.5X TBE, equilibrated for 30 minutes in the electrophoresis unit at 
14 C, and electrophoresed at 6 volts/cm for 14 hrs with circulation. Switching times were 
ramped from 10 sec to 20 sec. Gels were stained after electrophoresis in 0.5 ug/ml ethidium 
bromide. Molecular weight markers included undigested lambda DNA, HindHI digested 
lambda DNA, lambda ladder PFG ladder, and low range PFG marker (all from New England 
Biolabs). 

B AC DNA prepared either by the manual alkaline lysis or Autogen protocols were 
labeled for FISH analysis using a Bioprime labeling kit (BioRad) according to the 
manufacturer's recommendation with minor modifications. Approximately 200 ng of DNA 
was used for each 50 ul reaction. 3 ul were analyzed on a 2% agarose gel to determine the 
extent of labeling. Reactions were purified using a Sephadex G50 spin column prior to in situ 
hybridization. Metaphase FISH was performed as described (Ma et al, Cytogenet. Cell 
Genet., 74:266-271 (1996)). 
VI. BAC Endsequencing 

The sequencing of BAC insert ends utilized DNA prepared by either of the two 
methods described above. The DYEnamic energy transfer primers and Dynamic Direct cycle 
sequencing kits from Amersham were used for sequencing reactions. Ready made 
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sequencing mix including the M13 -40 forward sequencing primer was used (Catalog # 
US79730) for the T7 BAC vector terminus; ready made sequencing mix (Catalog # 
US79530) was mixed with the Ml 3 -28 reverse sequencing primer (Catalog # US79339) for 
the SP6 BAC vector terminus. The sequencing reaction mixes included one of the four 
fluorescently labeled dye-primers, one of the four dideoxy termination mixes, dNTPs, 
reaction buffer, and Thermosequenase. For each BAC DNA sample, 3 ul of the BAC DNA 
sample was aliquoted to 4 PCR strip tubes. 2 ul of one of the four dye primer/termination 
mix combinations was then added to each of the four tubes. The tubes were then sealed and 
centrifuged briefly prior to PCR. Thermocycling conditions involved a 1 minute denaturation 
at 95 C, 15 second annealing at 45 C, and extension for 1 minute at 70 C for 35 total cycles. 
After cycling the plates were centrifuged briefly to collect all the liquid to the bottom of the 
tubes. 5 ul of sterile distilled and deionized H 2 0 was then added into each tube, the plates 
sealed and centrifuged briefly again. The four samples for each BAC were then pooled 
together. DNA was then precipitated by adding 1.5 ul of 7.5 M NH 4 OAc and 100 ul of -20 C 
100% ethanol to each tube. Samples were mixed by pipetting up and down once. The plates 
were then sealed and incubated on ice for 10 minutes. Plates were centrifuged in a table top 
Haraeus centrifuge at 4000 rpm (3,290 g) for 30 minutes at 4 C to recover the DNA. The 
supernatant was removed and excess liquid blotted onto paper towels. Pellets were washed 
by adding 100 ul of -20 C 70% ethanol into each tube and recentrifuging at 4000 rpm (3,290 
g) for 10 minutes at 4 C. The supernatant was removed and excess liquid again removed by 
blotting on a paper towel. Remaining traces of liquid were removed by placing the plates 
upside down over a paper towel and centrifuging only until the centrifuge reached 800 rpm. 
Samples were then air dried at room temperature for 30 min. Tubes were capped and stored 
dry at -20 C until electrophoresis. Immediately prior to electrophoresis the DNA was 
dissolved in 1.5 ul of Amersham loading dye. Plates were then sealed and centrifuged at 
2000 rpm (825 g). The plates were then vortexed on a plate shaker for 1-2 minutes. Samples 
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were then recentrifuged at 2000 rpm (825 g) briefly. Samples were then heated at 65 C for 2 
min. and immediately placed on ice. Standard gel electrophoresis was performed on ABI 377 
fluorescent sequencers according to the manufacturer's recommendation. 
VII. Sub-cloning and Sequencing of HBM BAC DNA 

The physical map of the Zmaxl gene region provides a set of BAC clones that contain 
within them the Zmaxl gene and the HBM gene. DNA sequencing of several of the BACs 
from the region has been completed. The DNA sequence data is a unique reagent that 
includes data that one skilled in the art can use to identify the Zmaxl gene and the HBM 
gene, or to prepare probes to identify the gene(s), or to identify DNA sequence 
polymorphisms that identify the gene(s). 

BAC DNA was isolated according to one of two protocols, either a Qiagen 
purification of BAC DNA (Qiagen, Inc. as described in the product literature) or a manual 
purification which is a modification of the standard alkaline lysis/Cesium Chloride 
preparation of plasmid DNA (see e.g., Ausubel et al, Current Protocols in Molecular Biology, 
John Wiley & Sons (1997)). Briefly for the manual protocol, cells were pelleted, 
resuspended in GTE (50 mM glucose, 25 mM Tris-Cl (pH 8), 10 mM EDTA) and lysozyme 
(50 mg/ml solution), followed by NaOH/SDS (1% SDS/.2N NaOH) and then an ice-cold 
solution of 3M KOAc (pH 4.5-4.8). RnaseA was added to the filtered supernatant, followed 
by Proteinase K and 20% SDS. The DNA was then precipitated with isopropanol, dried and 
resuspended in TE (10 mM Tris, 1 mM EDTA (pH 8.0)). The BAC DNA was further 
purified by Cesium Chloride density gradient centrifugation (Ausubel et al, Current 
Protocols in Molecular Biology, John Wiley & Sons (1997)). 

Following isolation, the BAC DNA was sheared hydrodynamically using an HPLC 
(Hengen, Trends in Biochem. Set, 22:273-274 (1997)) to an insert size of 2000-3000 bp. 
After shearing, the DNA was concentrated and separated on a standard 1% agarose gel. A 
single fraction, corresponding to the approximate size, was excised from the gel and purified 
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by electrocution (Sambrook et al, Molecular Cloning: A Laboratory Manual, Cold Spring 
Harbor Laboratory, Cold Spring, NY (1989)). 

The purified DNA fragments were then blunt-ended using T4 DNA polymerase. The 
blunt-ended DNA was then ligated to unique BstXI-linker adapters (5' 
GTCTTCACCACGGGG and 5' GTGGTGAAGAC in 100-1000 fold molar excess). These 
linkers were complimentary to the BstXI-cut pMPX vectors (constructed by the inventors), 
while the overhang was not self-complimentary. Therefore, the linkers would not 
concatemerize nor would the cut-vector religate itself easily. The linker-adapted inserts were 
separated from the unincorporated linkers on a 1% agarose gel and purified using GeneClean 
(BIO 101, Inc.). The linker-adapted insert was then ligated to a modified pBlueScript vector 
to construct a "shotgun" subclone library. The vector contained an out-of-frame lacZ gene at 
the cloning site which became in-frame in the event that an adapter-dimer is cloned, allowing 
these to be avoided by their blue-color. 

All subsequent steps were based on sequencing by ABI377 automated DNA 
sequencing methods. Only major modifications to the protocols are highlighted. Briefly, the 
library was then transformed into DH5 competent cells (Life Technologies, Bethesda, MD, 
DH5 transformation protocol). It was assessed by plating onto antibiotic plates containing 
ampicillin and IPTG/Xgal. The plates were incubated overnight at 37 C. Successful 
transformants were then used for plating of clones and picking for sequencing. The cultures 
were grown overnight at 37 C. DNA was purified using a silica bead DNA preparation (Ng et 
al, Nucl. Acids Res., 24:5045-5047 (1996)) method. In this manner, 25 ng of DNA was 
obtained per clone. 

These purified DNA samples were then sequenced using ABI dye-terminator 
chemistry. The ABI dye terminator sequence reads were run on ABI377 machines and the 
data was directly transferred to UNIX machines following lane tracking of the gels. All reads 
were assembled using PHRAP (P. Green, Abstracts of DOE Human Genome Program 
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Contractor-Grantee Workshop V, Jan. 1996, p.157) with default parameters and quality 
scores. The initial assembly was done at 6-fold coverage and yielded an average of 8-15 
contigs. Following the initial assembly, missing mates (sequences from clones that only 
gave one strand reads) were identified and sequenced with ABI technology to allow the 
identification of additional overlapping contigs. Primers for walking were selected using a 
Genome Therapeutics program Pick_primer near the ends of the clones to facilitate gap 
closure. These walks were sequenced using the selected clones and primers. Data were 
reassembled with PHRAP into sequence contigs. 
VIII. Gene Identification by Computational Methods 

Following assembly of the BAC sequences into contigs, the contigs were subjected to 
computational analyses to identify coding regions and regions bearing DNA sequence 
similarity to known genes. This protocol included the following steps. 

1 . Degap the contigs: the sequence contigs often contain symbols (denoted by a 
period symbol) that represent locations where the individual ABI sequence reads have 
insertions or deletions. Prior to automated computational analysis of the contigs, the periods 
were removed. The original data was maintained for future reference. 

2. BAC vector sequences were "masked" within the sequence by using the 
program cross match (Phil Green, http:\\chimera.biotech. Washington. edutfJWGC). Since the 
shotgun libraries construction detailed above leaves some BAC vector in the shotgun 
libraries, this program was used to compare the sequence of the BAC contigs to the BAC 
vector and to mask any vector sequence prior to subsequent steps. Masked sequences were 
marked by an "X" in the sequence files, and remained inert during subsequent analyses. 

3 . E. coli sequences contaminating the BAC sequences were masked by 
comparing the BAC contigs to the entire E. coli DNA sequence. 

4. Repetitive elements known to be common in the human genome were masked 
using cross match. In this implementation of crossmatch, the BAC sequence was compared 
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to a database of human repetitive elements (Jerzy Jerka, Genetic Information Research 
Institute, Palo Alto, CA). The masked repeats were marked by X and remained inert during 

subsequent analyses. 

5 . The location of exons within the sequence was predicted using the MZEF 

computer program (Zhang, Proc. Natl. Acad. ScL, 94:565-568 (1997)). 

6. The sequence was compared to the publicly available unigene database 
(National Center for Biotechnology Information, National Library of Medicine, 38A, 8N905, 
8600 Rockville Pike, Bethesda, MD 20894; www.ncbi.nlm.nih.gov) using the blastn2 
algorithm (Altschul et al, Nucl. Acids Res., 25:3389-3402 (1997)). The parameters for this 
search were: E=0.05, v=50, B=50 (where E is the expected probability score cutoff, V is the 
number of database entries returned in the reporting of the results, and B is the number of 
sequence alignments returned in the reporting of the results (Altschul et al, J. Mol. Biol, 

215:403-410(1990)). 

7. The sequence was translated into protein for all six reading frames, and the 
protein sequences were compared to a non-redundant protein database compiled from 
Genpept Swissprot PIR (National Center for Biotechnology Information, National Library of 
Medicine, 38A, 8N905, 8600 Rockville Pike, Bethesda, MD 20894; www.ncbi.nlm.nih.gov). 

The parameters for this search were E=0.05, V=50, B= 50, where E, V, and B are defined as 
above. 

8. The BAC DNA sequence was compared to the database of the cDNA clones 
derived from direct selection experiments (described below) using blastn2 (Altschul et al, 
Nucl. Acids. Res., 25:3389-3402 (1997)). The parameters for this search were E=0.05, V-250, 
B=250, where E, V, and B are defined as above. 

9. The BAC sequence was compared to the sequences of all other BACs from the 
HBM region on chromosome 1 lql2-13 using blastn2 (Altschul et al, Nucl. Acids. Res., 
25:3389-3402 (1997)). The parameters for this search were E=0.05, V=50, B=50, where E, V, 
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and B are defined as above. 

1 0. The B AC sequence was compared to the sequences derived from the ends of 
BACs from the HBM region on chromosome 1 lql2-13 using blastn2 (Altschul et al, Nucl, 
Acids. Res., 25:3389-3402 (1997)). The parameters for this search were E=0.05, V=50, B=50, 
where E, V, and B are defined as above. 

11. The BAC sequence was compared to the Genbank database (National Center 
for Biotechnology Information, National Library of Medicine, 38A, 8N905, 8600 Rockville 
Pike, Bethesda, MD 20894; www.ncbi.nlm.nih.gov) using blastn2 (Altschul et al, Nucl. 
Acids. Res., 25:3389-3402 (1997)). The parameters for this search were E=0.05, V=50, B=50, 
where E, V, and B are defined as above. 

12. The BAC sequence was compared to the STS division of Genbank database 
(National Center for Biotechnology Information, National Library of Medicine, 3 8 A, 8N905, 
8600 Rockville Pike, Bethesda, MD 20894; www.ncbi.nlm.nih.gov) using blastn2 (Altschul 
et al, 1997). The parameters for this search were E=0.05, V=50, B= 50, where E, V, and B are 
defined as above. 

1 3 . The BAC sequence was compared to the Expressed Sequence (EST) Tag 
Genbank database (National Center for Biotechnology Information, National Library of 
Medicine, 38A, 8N905, 8600 Rockville Pike, Bethesda, MD 20894; www.ncbi.nlm.nih.gov) 
using blastn2 (Altschul et al, Nucl. Acids. Res., 25:3389-3402 (1997)). The parameters for 
this search were E=0.05, V=250, B=250, where E, V, and B are defined as above. 

IX. Gene Identification by Direct cDNA Selection 

Primary linkered cDNA pools were prepared from bone marrow, calvarial bone, 
femoral bone, kidney, skeletal muscle, testis and total brain. Poly (A) + RNA was prepared 
from calvarial and femoral bone tissue (Chomczynski et al, Anal. Biochem., 162:156-159 
(1987); D'Alessio et al, Focus, 9:1-4 (1987)) and the remainder of the mRNA was purchased 
from Clontech (Palo Alto, California). In order to generate oligo(dT) and random primed 
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cDNA pools from the same tissue, 2.5 ug mRNA was mixed with oligo(dT) primer in one 
reaction and 2.5 ug mRNA was mixed with random hexamers in another reaction, and both 
were converted to first and second strand cDNA according to manufacturers 
recommendations (Life Technologies, Bethesda, MD). Paired phosphorylated cDNA linkers 
(see sequence below) were annealed together by mixing in a 1:1 ratio (10 Ug each) incubated 
at 65 C for five minutes and allowed to cool to room temperature. 
Paired linkers oligol/2 

OLIGO 1 : 5'CTG AGC GGA ATT CGT GAG ACC3' (SEQ ID NO:12) 
OLIGO 2: 5'TTG GTC TCA CGT ATT CCG CTC GA3' (SEQ ID NO:13) 

Paired linkers oligo3/4 

OLIGO 3: 5'CTC GAG AAT TCT GGA TCC TC3' (SEQ ID NO: 14) 
OLIGO 4: 5'TTG AGG ATC CAG AAT TCT CGA G3' (SEQ ID NO: 1 5) 



Paired linkers oligo5/6 

OLIGO 5: 5'TGT ATG CGA ATT CGC TGC GCG3' (SEQ ID NO:16) 
25 OLIGO 6: 5'TTC GCG CAG CGA ATT CGC ATA CA3' (SEQ ID NO: 17) 

Paired linkers oligo7/8 
30 OLIGO 7: 5'GTC CAC TGA ATT CTC AGT GAG3' (SEQ ID NO: 1 8) 

OLIGO 8: 5'TTG TCA CTG AGA ATT CAG TGG AC3' (SEQ ID NO: 1 9) 

35 Paired linkers oligo 11/12 

OLIGO 1 1 : 5'GAA TCC GAA TTC CTG GTC AGC3' (SEQ ID NO:20) 
OLIGO 12: 5'TTG CTG ACC AGG AAT TCG GAT TC3' (SEQ ID NO:21) 

40 

Linkers were ligated to all oligo(dT) and random primed cDNA pools (see below) according 
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to manufacturers instructions (Life Technologies, Bethesda, MD). 

Oligo 1/2 was ligated to oligo(dT) and random primed cDNA pools prepared from 
bone marrow. Oligo 3/4 was ligated to oligo(dT) and random primed cDNA pools prepared 
from calvarial bone. Oligo 5/6 was ligated to oligo(dT) and random primed cDNA pools 
prepared from brain and skeletal muscle. Oligo 7/8 was ligated to oligo(dT) and random 
primed cDNA pools prepared from kidney. Oligo 1 1/12 was ligated to oligo(dT) and random 
primed cDNA pools prepared from femoral bone. 

The cDNA pools were evaluated for length distribution by PCR amplification using 1 
Hi of a 1 : 1 , 1 : 1 0, and 1 : 1 00 dilution of the ligation reaction, respectively. PCR reactions were 
performed in a Perkin Elmer 9600, each 25 ul volume reaction contained 1 ul of DNA, 10 
mM Tris-HCl (pH 8.3), 50 mM KC1, 1.5 mM MgC12, 0.001% gelatin, 200 mM each dNTPs, 
10 uM primer and 1 unit Taq DNA polymerase (Perkin Elmer) and was amplified under the 
following conditions: 30 seconds at 94 C, 30 seconds at 60 C and 2 minutes at 72 C for 30 
cycles. The length distribution of the amplified cDNA pools were evaluated by 
electrophoresis on a 1% agarose gel. The PCR reaction that gave the best representation of 
the random primed and oligo(dT) primed cDNA pools was scaled up so that -2-3 ug of each 
cDNA pool was produced. The starting cDNA for the direct selection reaction comprised of 
0.5 ug of random primed cDNAs mixed with 0.5 ug of oligo(dT) primed cDNAs. 

The DNA from the 54 BACs that were used in the direct cDNA selection procedure 
was isolated using Nucleobond AX columns as described by the manufacturer (The Nest 
Group, Inc.). 

The BACs were pooled in equimolar amounts and 1 ug of the isolated genomic DNA 
was labelled with biotin 16-UTP by nick translation in accordance with the manufacturers 
instructions (Boehringer Mannheim). The incorporation of the biotin was monitored by 
methods that could be practiced by one skilled in the art (Del Mastro and Lovett, Methods in 
Molecular Biology, Humana Press Inc., NJ (1996)). 
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Direct cDNA selection was performed using methods that could be practiced by one 
skilled in the art (Del Mastro and Lovett, Methods in Molecular Biology, Humana Press Inc., 
NJ (1996)). Briefly, the cDNA pools were multiplexed in two separate reactions: In one 
reaction cDNA pools from bone marrow, calvarial bone, brain and testis were mixed, and in 
the other cDNA pools from skeletal muscle, kidney and femoral bone were mixed. 
Suppression of the repeats, yeast sequences and plasmid in the cDNA pools was performed to 
a Cot of 20. 100 ng of biotinylated BAC DNA was mixed with the suppressed cDNAs and 
hybridized in solution to a Cot of 200. The biotinylated DNA and the cognate cDNAs was 
captured on streptavidin-coated paramagnetic beads. The beads were washed and the primary 
selected cDNAs were eluted. These cDNAs were PCR amplified and a second round of 
direct selection was performed. The product of the second round of direct selection is 
referred to as the secondary selected material. A Galanin cDNA clone, previously shown to 
map to 1 lql2-13 (Evans, Genomics, 18:473-477 (1993)), was used to monitor enrichment 
during the two rounds of selection. 

The secondary selected material from bone marrow, calvarial bone, femoral bone, 
kidney, skeletal muscle, testis and total brain was PCR amplified using modified primers of 
oligos 1, 3, 5, 7 and 1 1, shown below, and cloned into the UDG vector pAMPIO (Life 
Technologies, Bethesda, MD), in accordance with the manufacturer's recommendations. 
Modified primer sequences: 

Oligol-CUA: 5'CUA CUA CUA CUA CTG AGC GGA ATT CGT GAG ACC3' (SEQ ID 
NO:22) 

01igo3-CUA: 5'CUA CUA CUA CUA CTC GAG AAT TCT GGA TCC TC3' (SEQ ID 
NO:23) 

OligoS-CUA: 5'CUA CUA CUA CUA TGT ATG CGA ATT CGC TGC GCG3' (SEQ ID 
NO:24) 

OUgo7-CUA: 5'CUA CUA CUA CUA GTC CAC TGA ATT CTC AGT GAG3' (SEQ ID 
NO:25) 
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Oligol 1-CUA: 5'CUA CUA CUA CUA GAA TCC GAA TTC CTG GTC AGC3' (SEQ ID 
NO:26) 

The cloned secondary selected material, from each tissue source, was transformed into 
MAX Efficiency DH5a Competent Cells (Life Technologies, Bethesda, MD) as 
recommended by the manufacturer. 384 colonies were picked from each transformed source 
and arrayed into four 96 well microtiter plates. 

All secondarily selected cDNA clones were sequenced using Ml 3 dye primer 
terminator cycle sequencing kit (Applied Biosystems), and the data collected by the ABI 377 
automated fluorescence sequencer (Applied Biosystems). 

All sequences were analyzed using the BLASTN, BLASTX and FASTA programs 
(Altschul et al, J. Mol. Biol, 215:403-410 (1990), Altschul et al, Nucl. Acids. Res., 25:3389- 
3402 (1997)). The cDNA sequences were compared to a database containing sequences 
derived from human repeats, mitochondrial DNA, ribosomal RNA, E. coli DNA to remove 
background clones from the dataset using the program crossmatch. A further round of 
comparison was also performed using the program BLASTN2 against known genes 
(Genbank) and the B AC sequences from the HBM region. Those cDNAs that were >90% 
homologous to these sequences were filed according to the result and the data stored in a 
database for further analysis. cDNA sequences that were identified but did not have 
significant similarity to the BAC sequences from the HBM region or were eliminated by 
crossmatch were hybridized to nylon membranes which contained the BACs from the HBM 
region, to ascertain whether they hybridized to the target. 

Hybridization analysis was used to map the cDNA clones to the BAC target that 
selected them. The BACs that were identified from the HBM region were arrayed and grown 
into a 96 well microtiter plate. LB agar containing 25 ug/ml kanamycin was poured into 96 
well microtiter plate lids. Once the agar had solidified, pre-cut Hybond N+ nylon membranes 
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(Amersham) were laid on top of the agar and the BACs were stamped onto the membranes in 
duplicate using a hand held 96 well replica plater (V&P Scientific, Inc.). The plates were 
incubated overnight at 37 C. The membranes were processed according to the manufacturers 
recommendations. 

The cDNAs that needed to be mapped by hybridization were PCR amplified using the 
relevant primer (oligos 1, 3, 5, 7 and 1 1) that would amplify that clone. For this PCR 
amplification, the primers were modified to contain a linkered digoxigenin molecule at the 5' 
of the oligonucleotide. The PCR amplification was performed under the same conditions as 
described in Preparation of cDNA Pools (above). The PCR products were evaluated for 
quality and quantity by electrophoresis on a 1% agarose gel by loading 5 ul of the PCR 
reaction. The nylon membranes containing the stamped BACs were individually pre- 
hybridized in 50 ml conical tubes containing 10 ml of hybridization solution (5x SSPE, 0.5x 
Blotto, 2.5% SDS and 1 mM EDTA (pH 8.0)). The 50 ml conical tubes were placed in a 
rotisserie oven (Robbins Scientific) for 2 hours at 65 C. 25 ng of each cDNA probe was 
denatured and added into individual 50 ml conical tubes containing the nylon membrane and 
hybridization solution. The hybridization was performed overnight at 65 C. The filters were 
washed for 20 minutes at 65 C in each of the following solutions: 3x SSPE, 0.1% SDS; lx 
SSPE, 0.1% SDS and O.lx SSPE, 0.1% SDS. 

The membranes were removed from the 50 ml conical tubes and placed in a dish. 
Acetate sheets were placed between each membrane to prevent them from sticking to each 
other. The incubation of the membranes with the Anti-DIG-AP and CDP-Star was performed 
according to manufacturers recommendations (Boehringer Mannheim). The membranes were 
wrapped in Saran wrap and exposed to Kodak Bio-Max X-ray film for 1 hour. 
X. cDNA Cloning and Expression Analysis 

To characterize the expression of the genes identified by direct cDNA selection and 
genomic DNA sequencing in comparison to the publicly available databases, a series of 
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experiments were performed to further characterize the genes in the HBM region. First, 
oligonucleotide primers were designed for use in the polymerase chain reaction (PCR) so that 
portions of a cDNA, EST, or genomic DNA could be amplified from a pool of DNA 
molecules (a cDNA library) or RNA population (RT-PCR and RACE). The PCR primers 
were used in a reaction containing genomic DNA to verify that they generated a product of 
the size predicted based on the genomic (B AC) sequence. A number of cDNA libraries were 
then examined for the presence of the specific cDNA or EST. The presence of a fragment of 
a transcription unit in a particular cDNA library indicates a high probability that additional 
portions of the same transcription unit will be present as well. 

A critical piece of data that is required when characterizing novel genes is the length, 
in nucleotides, of the processed transcript or messenger RNA (mRNA). One skilled in the art 
primarily determines the length of an mRNA by Northern blot hybridization (Sambrook et al, 
Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, Cold Spring 
Harbor NY (1989)). Groups of ESTs and direct-selected cDNA clones that displayed 
significant sequence similarity to sequenced BACs in the critical region were grouped for 
convenience into approximately 30 kilobase units. Within each 30 kilobase unit there were 
from one up to fifty ESTs and direct-selected cDNA clones which comprised one or more 
independent transcription units. One or more ESTs or direct-selected cDNAs were used as 
hybridization probes to determine the length of the mRNA in a variety of tissues, using 
commercially available reagents (Multiple Tissue Northern blot; Clontech, Palo Alto, 
California) under conditions recommended by the manufacturer. 

Directionally cloned cDNA libraries from femoral bone, and calvarial bone 
tissue were constructed by methods familiar to one skilled in the art (for example, Scares in 
Automated DNA Sequencing and Analysis, Adams, Fields and Venter, Eds., Academic Press, 
NY, pages 1 10-1 14 (1994)). Bones were initially broken into fragments with a hammer, and 
the small pieces were frozen in liquid nitrogen and reduced to a powder in a tissue pulverizer 
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(Spectrum Laboratory Products). RNA was extracted from the powdered bone by 
homogenizing the powdered bone 

with a standard Acid Guanidinium Thiocyanate-Phenol-Chloroform extraction 
buffer (e.g. Chomczynski and Sacchi, Anal. Biochem., 162:156-159 (1987)) using a polytron 
homogenizer (Brinkman Instruments). Additionally, human brain and lung total RNA was 
purchased from Clontech. PolyA RNA was isolated from total RNA using dynabeads-dT 
according to the manufacturer's recommendations (Dynal, Inc.). 

First strand cDNA synthesis was initiated using an oligonucleotide primer with the 
sequence: 5'-AACTGGAAGAATTCQCQGCCGCAGGAATTTTTTTTTT 
TTTTTTTT-3' (SEQ ID NO:27). This primer introduces a NotI restriction site (underlined) 
at the 3' end of the cDNA. First and second strand synthesis were performed using the 
"one-tube" cDNA synthesis kit as described by the anufacturer (Life Technologies, Bethesda, 
MD). Double stranded cDNAs were treated with T4 polynucleotide kinase to ensure that the 
ends of the molecules were blunt (Scares in Automated DNA Sequencing and Analysis, 
Adams, Fields and Venter, Eds., Academic Press, NY, pages 110-1 14 (1994)), and the blunt 
ended cDNAs were then size selected by a Biogel column (Huynh et al in DNA Cloning, Vol. 
1, Glover, Ed., IRL Press, Oxford, pages 49-78 (1985)) or with a size-sep 400 sepharose 
column (Pharmacia, catalog # 27-5105-01). Only cDNAs of 400 base pairs or longer were 
used in subsequent steps. EcoRI adapters (sequence: 5' OH-AATTCGGCACGAG-OH 3' 
(SEQ ID NO:28), and 5' p-CTCGTGCCG-OH 3' (SEQ ID NO:29)) were then ligated to the 
double stranded cDNAs by methods familiar to one skilled in the art (Scares, 1994). The 
EcoRI adapters were then removed from the 3' end of the cDNA by digestion with NotI 
(Soares, 1994). The cDNA was then ligated into the plasmid vector pBluescript II KS+ 
(Stratagene, La Jolla, California), and the ligated material was transformed into E. coli host 
DH10B or DH12S by electroporation methods familiar to one skilled in the art (Soares, 
1994). After growth overnight at 37 C, DNA was recovered from the E. coli colonies after 
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scraping the plates by processing as directed for the Mega-prep kit (Qiagen, Chatsworth, 
California). The quality of the cDNA libraries was estimated by counting a portion of the 
total numbers of primary transformants and determining the average insert size and the 
percentage of plasmids with no cDNA insert. Additional cDNA libraries (human total brain, 
heart, kidney, leukocyte, and fetal brain) were purchased from Life Technologies, Bethesda, 
MD. 

cDNA libraries, both oligo (dT) and random hexamer (N 6 ) primed, were used for 
isolating cDNA clones transcribed within the HBM region: human bone, human brain, human 
kidney and human skeletal muscle (all cDNA libraries were made by the inventors, except for 
skeletal muscle (dT) and kidney (dT) cDNA libraries). Four 10 x 10 arrays of each of the 
cDNA libraries were prepared as follows: the cDNA libraries were titered to 2.5 x 10 6 using 
primary transformants. The appropriate volume of frozen stock was used to inoculate 2 L of 
LB/ampicillin (100 mg/ml). This inoculated liquid culture was aliquotted into 400 tubes of 4 
ml each. Each tube contained approximately 5000 cfu. The tubes were incubated at 30 C 
overnight with gentle agitation. The cultures were grown to an OD of 0.7-0.9. Frozen stocks 
were prepared for each of the cultures by aliquotting 100 ul of culture and 300 ul of 80% 
glycerol. Stocks were frozen in a dry ice/ethanol bath and stored at -70 C. The remaining 
culture was DNA prepared using the Qiagen (Chatsworth, CA) spin miniprep kit according to 
the manufacturer's instructions. The DNAs from the 400 cultures were pooled to make 80 
column and row pools. The cDNA libraries were determined to contain HBM cDNA clones 
of interest by PCR. Markers were designed to amplify putative exons. Once a standard PCR 
optimization was performed and specific cDNA libraries were determined to contain cDNA 
clones of interest, the markers were used to screen the arrayed library. Positive addresses 
indicating the presence of cDNA clones were confirmed by a second PCR using the same 
markers. 

Once a cDNA library was identified as likely to contain cDNA clones corresponding 
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to a specific transcript of interest from the HBM region, it was manipulated to isolate the 
clone or clones containing cDNA inserts identical to the EST or direct-selected cDNA of 
interest. This was accomplished by a modification of the standard "colony screening" method 
(Sambrook et al, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, 
Cold Spring Harbor NY (1989)). Specifically, twenty 150 mm LB+ampicillin agar plates 
were spread with 20,000 colony forming units (cm) of cDNA library and the colonies allowed 
to grow overnight at 37 C. Colonies were transferred to nylon filters (Hybond from 
Amersham, or equivalent) and duplicates prepared by pressing two filters together essentially 
as described (Sambrook et al, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor 
Laboratory, Cold Spring Harbor NY (1989)). The "master" plate was then incubated an 
additional 6-8 hours to allow the colonies to grow back. The DNA from the bacterial 
colonies was then affixed to the nylon filters by treating the filters sequentially with 
denaturing solution (0.5 N NaOH, 1.5 M NaCl) for two minutes, neutralization solution (0.5 
M Tris-Cl pH 8.0, 1.5 M NaCl) for two minutes (twice). The bacterial colonies were 
removed from the filters by washing in a solution of 2X SSC/0.1% SDS for one minute while 
rubbing with tissue paper. The filters were air dried and baked under vacuum at 80 C for 1-2 
hours. 

A cDNA hybridization probe was prepared by random hexamer labeling (Fineberg 
and Vogelstein, Anal. Biochem., 132:6-13 (1983)) or by including gene-specific primers and 
no random hexamers in the reaction (for small fragments). Specific activity was calculated 
and was >5X10 8 cpm/10 8 ug of cDNA. The colony membranes were then prewashed in 10 
mM Tris-Cl pH 8.0, 1 M NaCl, 1 mM EDTA, 0.1% SDS for 30 minutes at 55 C. Following 
the prewash, the filters were prehybridized in > 2 ml/filter of 6X SSC, 50 % deionized 
formamide, 2% SDS, 5X Denhardfs solution, and 100 mg/ml denatured salmon sperm DNA, 
at 42 C for 30 minutes. The filters were then transferred to hybridization solution (6X SSC, 
2% SDS, 5X Denhardfs, 100 mg/ml denatured salmon sperm DNA) containing denatured 
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32 P-dCTP-labelled cDNA probe and incubated at 42 C for 16-18 hours. 

After the 16-18 hour incubation, the filters were washed under constant agitation in 
2X SSC, 2% SDS at room temperature for 20 minutes, followed by two washes at 65 C for 15 
minutes each. A second wash was performed in 0.5 X SSC, 0.5% SDS for 15 minutes at 
65 C. Filters were then wrapped in plastic wrap and exposed to radiographic film for several 
hours to overnight. After film development, individual colonies on plates were aligned with 
the autoradiograph so that they could be picked into a 1 ml solution of LB Broth containing 
ampicillin. After shaking at 37 C for 1-2 hours, aliquots of the solution were plated on 150 
mm plates for secondary screening. Secondary screening was identical to primary screening 
(above) except that it was performed on plates containing -250 colonies so that individual 
colonies could be clearly identified for picking. 

After colony screening with radiolabeled probes yielded cDNA clones, the clones 
were characterized by restriction endonuclease cleavage, PCR, and direct sequencing to 
confirm the sequence identity between the original probe and the isolated clone. To obtain 
the full-length cDNA, the novel sequence from the end of the clone identified was used to 
probe the library again. This process was repeated until the length of the cDNA cloned 
matches that estimated to be full-length by the northern blot analysis. 

RT-PCR was used as another method to isolate full length clones. The cDNA was 
synthesized and amplified using a "Superscript One Step RT-PCR" kit (Life Technologies, 
Gaithersburg, MD). The procedure involved adding 1.5 ug of RNA to the following: 25 ul of 
reaction mix provided which is a proprietary buffer mix with MgS0 4 and dNTP's, 1 ul sense 
primer (10 uM) and 1 ul anti-sense primer (10 uM), 1 ul reverse transcriptase and Taq DNA 
polymerase mix provided and autoclaved water to a total reaction mix of 50 ul. The reaction 
was then placed in a thermocycler for 1 cycle at 50 C for 15 to 30 minutes, then 94 C for 15 
seconds, 55-60 C for 30 seconds and 68-72 C for 1 minute per kilobase of anticipated product 
and finally 1 cycle of 72 C for 5-10 minutes. The sample was analyzed on an agarose gel. 
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The product was excised from the gel and purified from the gel (GeneClean, Bio 101). The 
purified product was cloned in pCTNR (General Contractor DNA Cloning System, 5 Prime - 
3 Prime, Inc.) and sequenced to verify that the clone was specific to the gene of interest. 

Rapid Amplification of cDNA ends (RACE) was performed following the 
manufacturer's instructions using a Marathon cDNA Amplification Kit (Clontech, Palo Alto, 
CA) as a method for cloning the 5' and 3' ends of candidate genes. cDNA pools were 
prepared from total RNA by performing first strand synthesis, where a sample of total RNA 
sample was mixed with a modified oligo (dT) primer, heated to 70 C, cooled on ice and 
followed by the addition of: 5x first strand buffer, 10 mM dNTP mix, and AMV Reverse 
Transcriptase (20 U/ul). The tube was incubated at 42 C for one hour and then the reaction 
tube was placed on ice. For second strand synthesis, the following components were added 
directly to the reaction tube: 5x second strand buffer, 10 mM dNTP mix, sterile water, 20x 
second strand enzyme cocktail and the reaction tube was incubated at 16 C for 1.5 hours. T4 
DNA Polymerase was added to the reaction tube and incubated at 16 C for 45 minutes. The 
second-strand synthesis was terminated with the addition of an EDTA/Glycogen mix. The 
sample was subjected to a phenol/chloroform extraction and an ammonium acetate 
precipitation. The cDNA pools were checked for quality by analyzing on an agarose gel for 
size distribution. Marathon cDNA adapters (Clontech) were then ligated onto the cDNA 
ends. The specific adapters contained priming sites that allowed for amplification of either 5' 
or 3' ends, depending on the orientation of the gene specific primer (GSP) that was chosen. 
An aliquot of the double stranded cDNA was added to the following reagents: 10 uM 
Marathon cDNA adapter, 5x DNA ligation buffer, T4 DNA ligase. The reaction was 
incubated at 16 C overnight. The reaction was heat inactivated to terminate the reaction. PCR 
was performed by the addition of the following to the diluted double stranded cDNA pool: 
lOx cDNA PCR reaction buffer, 10 uM dNTP mix, 10 uM GSP, 10 uM API primer (kit), 
50x Advantage cDNA Polymerase Mix. Thermal Cycling conditions were 94 C for 30 
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seconds, 5 cycles of 94 C for 5 seconds, 72 C for 4 minutes, 5 cycles of 94 C for 5 seconds, 
70 C for 4 minutes, 23 cycles of 94 C for 5 seconds, 68 C for 4 minutes. After the first round 
of PCR was performed using the GSP to extend to the end of the adapter to create the adapter 
primer binding site, exponential amplification of the specific cDNA of interest was observed, 
5 Usually a second nested PCR is performed to confirm the specific cDNA. The RACE 
product was analyzed on an agarose gel and then excised and purified from the gel 
(GeneClean, BIO 101). The RACE product was then cloned into pCTNR (General 
Contractor DNA Cloning System, 5' - 3\ Inc.) and the DNA sequence determined to verify 
that the clone is specific to the gene of interest. 

10 XI. Mutation Analysis 

Comparative genes were identified using the above procedures and the exons from 
each gene were subjected to mutation detection analysis. Comparative DNA sequencing was 
used to identify polymorphisms in HBM candidate genes from chromosome 1 lql2-13. DNA 
sequences for candidate genes were amplified from patient lymphoblastoid cell lines. 

15 The inventors developed a method based on analysis of direct DNA sequencing of 

PCR products amplified from candidate regions to search for the causative polymorphism. 
The procedure consisted of three stages that used different subsets of HBM family to find 
segregating polymorphisms and a population panel to assess the frequency of the 
polymorphisms. The family resources result from a single founder leading to the assumption 

20 that all affected individuals will share the same causative polymorphism. 

Candidate regions were first screened in a subset of the HBM family consisting of the 
proband, daughter, and her mother, father and brother. Monochromosomal reference 
sequences were produced concurrently and used for comparison. The mother and daughter 
carried the HBM polymorphism in this nuclear family, providing the ability to monitor 

25 polymorphism transmission. The net result is that two HBM chromosomes and six non-HBM 
chromosomes were screened. This allowed exclusion of numerous frequent alleles. Only 
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alleles exclusively present in the affected individuals passed to the next level of analysis. 

Polymorphisms that segregated exclusively with the HBM phenotype in this original 
family were then re-examined in an extended portion of the HBM pedigree consisting of two 
additional nuclear families. These families consisted of five HBM and three unaffected 
5 individuals. The HBM individuals in this group included the two critical crossover 
individuals, providing the centromeric and telomeric boundaries of the critical region. 
Tracking the heredity of polymorphisms between these individuals and their affected parents 
allowed for further refining of the critical region. This group brought the total of HBM 
chromosomes screened to seven and the total of non-HBM chromosomes to seventeen. 

10 When a given polymorphism continued to segregate exclusively with the HBM 

phenotype in the extended group, a population panel was then examined. This panel of 84 
persons consisted of 42 individuals known to have normal bone mineral density and 42 
individuals known to be unrelated but with untyped bone mineral density. Normal bone 
mineral density is within two standard deviations of BMD Z score 0. The second group was 

15 from the widely used CEPH panel of individuals. Any segregating polymorphisms found to 
be rare in this population were subsequently examined on the entire HBM pedigree and a 
larger population. 

Polymerase chain reaction (PCR) was used to generate sequencing templates from the 
HBM family's DNA and monochromosomal controls. Enzymatic amplification of genes 

20 within the HBM region on 1 lql2-13 was accomplished using the PCR with oligonucleotides 
flanking each exon as well as the putative 5' regulatory elements of each gene. The primers 
were chosen to amplify each exon as well as 1 5 or more base pairs within each intron on 
either side of the splice. All PCR primers were made as chimeras to facilitate dye primer 
sequencing. The M13-21F (5 T - GTA CGA CGG CCA GT -3*) (SEQ ID NO:30) and -28REV 

25 (5 1 - AAC AGC TAT GAC CAT G -3') (SEQ ID NO:3 1) primer binding sites were built on to 
the 5 T end of each forward and reverse PCR primer, respectively, during synthesis. 150 ng of 
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genomic DNA was used in a 50 (0,1 PCR with 2UAmpliTaq, 500 nM primer and 125 juM 
dNTP. Buffer and cycling conditions were specific to each primer set. TaqStart antibody 
(Clontech) was used for hot start PCR to minimize primer dimer formation. 10% of the 
product was examined on an agarose gel. The appropriate samples were diluted 1:25 with 
5 deionized water before sequencing. 

Each PCR product was sequenced according to the standard Energy Transfer primer 
(Amersham) protocol. All reactions took place in 96 well trays. 4 separate reactions, one 
each for A, C, G and T were performed for each template. Each reaction included 2 of the 
sequencing reaction mix and 3 \x\ of diluted template. The plates were then heat sealed with 

10 foil tape and placed in a thermal cycler and cycled according to the manufacturer's 

recommendation. After cycling, the 4 reactions were pooled, 3 pi of the pooled product was 
transferred to a new 96 well plate and 1 pi of the manufacturer's loading dye was added to 
each well. All 96 well pipetting procedures occurred on a Hydra 96 pipetting station 
(Robbins Scientific, USA), 1 \i\ of pooled material was directly loaded onto a 48 lane gel 

15 running on an ABI 377 DNA sequencer for a 10 hour, 2.4 kV run. 

Polyphred (University of Washington) was used to assemble sequence sets for 
viewing with Consed (University of Washington). Sequences were assembled in groups 
representing all relevant family members and controls for a specified target region. This was 
done separately for each of the three stages. Forward and reverse reads were included for 

20 each individual along with reads from the monochromosomal templates and a color annotated 
reference sequence. Polyphred indicated potential polymorphic sites with a purple flag. Two 
readers independently viewed each assembly and assessed the validity of the purple-flagged 
sites, 

A total of 23 exons present in the mature mRNA and several other portions of the 
25 primary transcript were evaluated for heterozygosity in the nuclear family of two HBM- 
affected and two unaffected individuals. 25 SNPs were identified, as shown in the table 
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below, 

TABLE 4: Single Nucleotide Polymorphisms in the Zmaxl Gene and Environs 



Exon Name 


Location 


Base Change 


b200e2 1 -h_Contigl_l .nt 


69169 (309G) 


C/A 


b200e2 1 -h_Contig4_ 1 2 .nt 


27402 (309G) 


A/G 


b200e2 1 -h_Contig4_l 3 .nt 


27841 (309G) 


T/C 


b200e2 1 -h_Contig4_l 6 .nt 


35600 (309G) 


A/G 


b200e2 1 -h_Contig4_2 1 .nt 


45619 (309G) 


G/A 


b200e2 1 -h_Contig4_22 .nt-a 


46018 (309G) 


T/G 


b200e2 1 -h_Contig4_22 .nt-b 


46093 (309G) 


T/G 


b200e2 1 -h_Contig4_22.nt-c 


46190 (309G) 


A/G 


b200e2 1 -h_Contig4_24.nt-a 


50993 (309G) 


T/C 


b200e2 1 -h_Contig4_24.nt-b 


51124 (309G) 


C/T 


b200e2 1 -h_Contig4_25 .nt 


55461 (309G) 


C/T 


b200e2 1 -h_Contig4_3 3 .nt-a 


63645 (309G) 


C/A 


b200e2 1 -h_Contig4_33 .nt-b 


63646 (309G) 


A/C 


b200e2 1 -h_Contig4_6 1 .nt 


24809 (309G) 


T/G 


b200e2 1 -h_Contig4_62.nt 


27837 (309G) 


T/C 


b200e2 1 -h_Contig4_63 .nt-a 


31485 (309G) 


C/T 
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h?00e21-b Contie4 63 nt-b 

\JJL\J\J\J£t 1 11 VJl i 11 .111, \J 


31683 (309G) 


A/G 


h?Onp?1-Vi Crmtiad Q nt 

DZfVJVJC^ 1 -11 V^CIltlg^t J7.11L 


24808 (309G) 


T/G 


b527dl2-h Contiff0302 1 nt-a 


31340 (308G) 


T/C 


bS27d12-h ConnV030e 1 nt-b 


32538 (308G) 


A/G 


b527d12-b rontiff080C 2 nt 


13224 (308G) 


A/G 


bS27d12-b Conti£087C 1 nt 

U J^- / UiZi 11 V^k^llUl^VU / 1 tilt 


21119 (3080 


C/A 


b527dl2-h Contig087C_4.nt 


30497 (308G) 


G/A 


b527dl 2-h_Contig088C_4.nt 


24811 (309G) 


A/C 


b527dl2-h_Contig089_lHP.nt 


68280 (309G) 


G/A 



In addition to the polymorphisms presented in Table 4, two additional 
polymorphisms can also be present in SEQ ID NO;2. These is a change at position 2002 of 
SEQ ID NO:2. Either a guanine or an adenine can appear at this position. This 
5 polymorphism is silent and is not associated with any change in the amino acid sequence. 
The second change is at position 4059 of SEQ ID NO:2 corresponding in a cytosine (C) to 
thymine (T) change. This polymorphism results in a corresponding amino acid change from 
a valine (V) to an alanine (A). Other polymorphisms were found in the candidate gene exons 
and adjacent intron sequences. Any one or combination of the polymorphisms listed in Table 
10 4 or the two discussed above could also have a minor effect on bone mass when present in 
SEQ ID NO:2. 

The present invention encompasses the nucleic acid sequences having the nucleic acid 
sequence of SEQ ID NO: 1 with the above-identified point mutations. 
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Preferably, the present invention encompasses the nucleic acid of SEQ ID NO: 2. 
Specifically, a base-pair substitution changing G to T at position 582 in the coding sequence 
of Zmaxl (the HBM gene) was identified as heterozygous in all HBM individuals, and not 
found in the unaffected individuals (i.e., b527dl2-h_Contig087C_l .nt). Fig. 5 shows the 
order of the contigs in B527D12. The direction of transcription for the HBM gene is from 
left to right. The sequence of contig308G of B527D12 is the reverse complement of the 
coding region to the HBM gene. Therefore, the relative polymorphism in contig 308G shown 
in Table 4 as a base change substitution of C to A is the complement to the G to T 
substitution in the HBM gene. This mutation causes a substitution of glycine 171 with valine 
(G171V). 

The HBM polymorphism was confirmed by examining the DNA sequence of different 
groups of individuals. In all members of the HBM pedigree (38 individuals), the HBM 
polymorphism was observed in the heterozygous form in affected (i.e., elevated bone mass) 
individuals only (N-18). In unaffected relatives (N=20) (BMDZ<2.0) the HBM 
polymorphism was never observed. To determine whether this gene was ever observed in 
individuals outside of the HBM pedigree, 297 phenotyped individuals were characterized at 
the site of the HBM gene. None were heterozygous at the site of the HBM polymorphism. In 
an unphenotyped control group, 1 of 42 individuals was observed to be heterozygous at 
position 582. Since this individual is deceased, their bone mineral density could not be 
obtained. Taken together, these data prove that the polymorphism observed in the kindred 
displaying the high bone mass phenotype is strongly correlated with the G T polymorphism at 
position 582 of Zmaxl. Taken together, these results establish that the HBM polymorphism 
genetically segregates with the HBM phenotype, and that both the HBM polymorphism and 
phenotype are rare in the general population. 
XII. Allele Specific Oligonucleotide (ASO) Analysis 

The amplicon containing the HBM1 polymorphism was PCR amplified using primers 

-66- 



Atty. Dkt. No.: 032796-014 

specific for the exon of interest. The appropriate population of individuals was PCR 
amplified in 96 well microtiter plates as follows. PCR reactions (20 ul) containing IX 
Promega PCR buffer (Cat. # Ml 883 containing 1 .5 mM MgCl 2 ), 1 00 M dNTP, 200 nM PCR 
primers (1863F: CCAAGTTCTGAGAAGTCC and 1864R: AATACCTGAAACCATAC 
CTG), 1 U Amplitaq, and 20 ng of genomic DNA were prepared and amplified under the 
following PCR conditions: 94°C, 1 minute, (94°C, 30 sec; 58°C, 30 sec; 72°C, 1 min.) X35 
cycles), 72°C, 5', 4°C, hold. Loading dye was then added and 10 ul of the products was 
electrophoresed on 1.5% agarose gels containing 1 ng/ml ethidium bromide at 100-150 V for 
5-10 minutes. Gels were treated 20 minutes in denaturing solution (1 .5 M NaCl, 0.5 N 
NaOH), and rinsed briefly with water. Gels were then neutralized in 1 M Tris-HCl, pH 7.5, 
1.5 M NaCl, for 20 minutes and rinsed with water. Gels were soaked in 10 X SSC for 20 
minutes and blotted onto nylon transfer membrane (Hybond N+- Amersham) in 10X SSC 
overnight. Filters were the rinsed in 6X SSC for 10 minutes and UV crosslinked. 

The allele specific oligonucleotides (ASO) were designed with the polymorphism 
approximately in the middle. Oligonucleotides were phosphate free at the 5 'end and were 
purchased from Gibco BRL. Sequences of the oligonucleotides are: 

2326 Zmaxl.ASO.g: AGACTGGGGTGAGACGC 

2327 ZmaxLASO.t: CAGACTGGGTTGAGACGCC 

The polymorphic nucleotides are underlined. To label the oligos, 1.5 ul of 1 u.g/ul ASO 
oligo (2326.Zmaxl.ASO.g or 2327.Zmaxl.ASO.t), 1 1 ul ddH 2 0, 2 ul 10X kinase forward 
buffer, 5 ul y- 32 P-ATP (6000 Ci/mMole), and l ul T4 polynucleotide kinase (10 U/ul) were 
mixed, and the reaction incubated at 37°C for 30-60 minutes. Reactions were then placed at 
95°C for 2 minutes and 30 1 H 2 0 was added. The probes were purified using a G25 microspin 
column (Pharmacia). 

Blots were prehybridized in 10 ml 5X SSPE, 5X Denhardt's, 2% SDS, and 100 ng/ml, 
denatured, sonicated salmon sperm DNA at 40°C for 2 nr. The entire reaction mix of kinased 
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oligo was then added to 10 ml fresh hybridization buffer (5X SSPE, 5X Denhardts, 2% SDS) and 

hybridized at 40°C for at least 4 hours to overnight. 

All washes done in 5X SSPE, 0.1 % SDS. The first wash was at 45°C for 15 minutes; 

the solution was then changed and the filters washed 50°C for 15 minutes. Filters were then 
5 exposed to Kodak biomax film with 2 intensifying screens at -70°C for 15 minutes to 1 hr. If 

necessary the filters were washed at 55°C for 15 minutes and exposed to film again. Filters 

were stripped by washing in boiling 0.1X SSC, 0.1% SDS for 10 minutes at least 3 times. 

The two films that best captured the allele specific assay with the 2 ASOs were converted 

into digital images by scanning them into Adobe PhotoShop. These images were overlaid 
10 against each other in Graphic Converter and then scored and stored in FileMaker Pro 4.0 (see 

Fig. 9). 

XIII. Cellular Localization of Zmaxl 

A. Gene Expression in Rat tibia by non isotopic In Situ Hybridization 

In situ hybridization was conducted by Pathology Associates International (PAI), 
15 Frederick, MD. This study was undertaken to determine the specific cell types that express 
the Zmaxl gene in rat bone with particular emphasis on areas of bone growth and 
remodeling. Zmaxl probes used in this study were generated from both human (HuZmaxl) 
and mouse (MsZmaxl) cDNAs, which share an 87% sequence identity. The homology of 
human and mouse Zmaxl with rat Zmaxl is unknown. 
20 For example, gene expression by non-isotopic in situ hybridization was performed as 

follows, but other methods would be known to the skilled artisan. Tibias were collected from 
two 6 to 8 week old female Sprague Dawley rats euthanized by carbon dioxide asphyxiation. 
Distal ends were removed and proximal tibias were snap frozen in OCT embedding medium 
with liquid nitrogen immediately following death. Tissues were stored in a -80°C freezer. 
25 Probes for amplifying PCR products from cDNA were prepared as follows. The 

primers to amplify PCR products from a cDNA clone were chosen using published sequences 
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of both human LRP5 (Genbank Accession No. ABO 17498) and mouse LRP5 (Genbank 
Accession No. AF064984). In order to minimize cross reactivity with other genes in the 
LDL receptor family, the PCR products were derived from an intracellular portion of the 
protein coding region. PCR was performed in a 50 pi reaction volume using cDNA clone as 
5 template. PCR reactions contained 1 .5 mM MgCl 2 , 1 unit Amplitaq, 200 uM dNTPs and 2 
uM each primer. PCR cycling conditions were 94°C for 1 min., followed by 35 cycles of 
94°C for 30 seconds, 55°C for 30 seconds, 72°C for 30 seconds; followed by a 5 minute 
extension at 72°C. The reactions were then run on a 1.5 % agarose Tris-Acetate gel. DNA 
was eluted from the agarose, ethanol precipitated and resuspended in 10 mM Tris, pH 8.0. 
10 Gel purified PCR products were prepared for both mouse and human cDNAs and supplied to 
Pathology Associates International for in situ hybridizations. 

The sequence of the human and mouse PCR primers and products were as follows: 
Human Zmax 1 sense primer (HBM1253^ 

CCCGTGTGCTCCGCCGCCCAGTTC 
15 Human Zmax 1 antisense primer ( HBM 1465) 

GGCTCACGGAGCTCATCATGGACTT 
Human Zmax 1 PCR product 

CCCGTGTGCTCCGCCGCCCAGTTCCCCTGCGCGCGGGGTCAGTGTGTGGACCTGC 
GCCTGCGCTGCGACGGCGAGGCAGACTGTCAGGACCGCTCAGACGAGGTGGACT 

20 GTGACGCCATCTGCCTGCCCAACCAGTTCCGGTGTGCGAGCGGCCAGTGTGTCCT 
CATCAAACAGCAGTGCGACTCCTTCCCCGACTGTATCGACGGCTCCGACGAGCTC 
ATGTGTGAAATCACCAAGCCGCCCTCAGACGACAGCCCGGCCCACAGCAGTGCC 
ATCGGGCCCGTCATTGGCATCATCCTCTCTCTCTTCGTCATGGGTGGTGTCTATTT 
TGTGTGCCAGCGCGTGGTGTGCCAGCGCTATGCGGGGGCCAACGGGCCCTTCCCG 

25 CACGAGTATGTCAGCGGGACCCCGCACGTGCCCCTCAATTTCATAGCCCCGGGCG 
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GTTCCCAGCATGGCCCCTTCACAGGCATCGCATGCGGAAAGTCCATGATGAGCTC 
CGTGAGCC 

Mouse Zmax 1 Sense primer (H BM1655> 
AGCGAGGCCACCATCCACAGG 
Mouse zm ax 1 antisense primer (HBM 16561 
TC GCTGGTCGGC AT AATC AAT 
Mouse Zmaxl PGR product 

AGCAGAGCCACCATCCACAGGATCTCCCTGGAGACTAACAACAACGATGTGGCT 

ATCCCACTCACGGGTGTCAAAGAGGCCTCTGCACTGGACTTTGATGTGTCCAACA 

ATCACATCTACTGGACTGATGTTAGCCTCAAGACGATCAGCCGAGCCTTCATGAA 

TGGGAGCTCAGTGGAGCACGTGATTGAGTTTGGCCTCGACTACCCTGAAGGAATG 

GCTGTGGACTGGATGGGCAAGAACCTCTATTGGGCGGACACAGGGACCAACAGG 

ATTGAGGTGGCCCGGCTGGATGGGCAGTTCCGGCAGGTGCTTGTGTGGAGAGAC 

CTTGACAACCCCAGGTCTCTGGCTCTGGATCCTACTAAAGGCTACATCTACTGGA 

CTGAGTGGGGTGGCAAGCCAAGGATTGTGCGGGCCTTCATGGATGGGACCAATT 

GTATGACACTGGTAGACAAGGTGGGCCGGGCCAACGACCTCACCATTGATTATG 

CCGACCAGCGA 

Riboprobes were synthesized as follows. The PCR products were reamplified with 
chimeric primers designed to incorporate either a T3 promoter upstream, or a T7 promoter 
downstream of the reamplification products. The resulting PCR products were used as 
template to synthesize digoxigenin-labeled riboprobes by in vitro transcription (IVT). 
Antisense and sense riboprobes were synthesized using T7 and T3 RNA polymerases, 
respectively, in the presence of digoxigenin-ll-UTP (Boehringer-Mannheim) using a 
MAXIscript IVT kit (Ambion) according to the manufacturer. The DNA was then degraded 
with Dnase-1, and unincorporated digoxigenin was removed by ultrafiltration. Riboprobe 
integrity was assessed by electrophoresis through a denaturing polyacrylamide gel. 
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Molecular size was compared with the electrophoretic mobility of a 100-1000 base pair (bp) 
RNA ladder (Ambion). Probe yield and labeling was evaluated by blot immunochemistry. 
Riboprobes were stored in 5 ul aliquots at -80 C. 

The in situ hybridization was performed as follows. Frozen rat bone was cut into 5 
5 fiM sections on a Jung CM3000 cryostat (Leica) and mounted on adhesive slides 

(Instrumedics). Sections were kept in the cryostat at -20°C until all the slides were prepared 
in order to prevent mRNA degradation prior to post-fixation for 15 minutes in 4% 
paraformaldehyde. Following post-fixation, sections were incubated with 1 ng/ul of either 
antisense or sense riboprobe in Pathology Associates International (PAI) customized 
10 hybridization buffer for approximately 40 hours at 58°C. Following hybridization, slides 
were subjected to a series of post-hybridization stringency washes to reduce nonspecific 
probe binding. Hybridization was visualized by immunohistochemistry with an anti- 
digoxigenin antibody (FAB fragment) conjugated to alkaline phosphatase. Nitroblue 
tetrazolium chloride/bromochloroindolyl phosphate (Boehringer-Mannheim), a precipitating 
15 alkaline phosphatase substrate, was used as the chromogen to stain hybridizing cells purple to 
nearly black, depending on the degree of staining. Tissue sections were counter-stained with 
nuclear fast red. Assay controls included omission of the probe, omission of probe and anti- 

digoxigenin antibody. 

Specific cell types were assessed for demonstration of hybridization with antisense 

20 probes by visualizing a purple to black cytoplasmic and/or peri-nuclear staining indicating a 
positive hybridization signal for mRNA. Each cell type was compared to the replicate 
sections, which were hybridized with the respective sense probe. Results were considered 
positive if staining was observed with the antisense probe and no staining or weak 
background with the sense probe. 

25 The cellular localization of the hybridization signal for each of the study probes is 

summarized in Table 5. Hybridization for Zmaxl was primarily detected in areas of bone 
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involved in remodeling, including the endosteum and trabecular bone within the metaphysis. 
Hybridization in selected bone lining cells of the periosteum and epiphysis were also 
observed. Positive signal was also noted in chondrocytes within the growth plate, particularly 
in the proliferating chondrocytes. See Figs. 10, 11 and 12 for representative 
photomicrographs of in situ hybridization results. 

TABLE 5 

Summary of Zmaxl in situ hybridization in rat tibia 



PROBE 


SITE 


ISH SIGNAL 


Hu Zmaxl 


Epiphysis 






Osteoblasts 


+ 




Osteoclasts 






Growth Plate 






resting chondrocytes 


- 




proligferating chondrocytes 


+ 




hypertrophic chondrocytes 


- 




Metaphvsis 






osteoblasts 


+ 




osteoclasts 


+ 




Diaphvsis 






Endosteum 






osteoblasts 


+ 




osteoclasts 


+ 




Periosteum 
















MsZmaxl 


Epiphysis 






Osteoblasts 


+ 




Osteoclasts 






Growth Plate 






resting chondrocytes 






proligferating chondrocytes 


+ 




hypertrophic chondrocytes 


+ 




Metaphvsis 






osteoblasts 


+ 




osteoclasts 


+ 




Diaphvsis 
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Endosteum 






osteoblasts 


+ 




osteoclasts 


+ 




Periosteum 


+ 









Legend: "+" = hybridization signal detected = no hybridization signal detected 
"ISH" - In situ hybridization 



These studies confirm the positional expression of Zmaxl in cells involved in bone 
5 remodeling and bone formation. Zmaxl expression in the zone of proliferation and in the 
osteoblasts and osteoclasts of the proximal metaphysis, suggests that the Zmaxl gene is 
involved in the process of bone growth and mineralization. The activity and differentiation of 
osteoblasts and osteoclasts are closely coordinated during development as bone is formed and 
during growth as well as in adult life as bone undergoes continuous remodeling. The 
10 formation of internal bone structures and bone remodeling result from the coupling of bone 
resorption by activated osteoclasts with subsequent deposition of new material by osteoblasts. 
Zmaxl is related to the LDL receptor gene, and thus may be a receptor involved in 
mechanosensation and subsequent signaling in the process of bone remodeling. Therefore, 
changes in the level of expression of this gene could impact on the rate of remodeling and 
15 degree of mineralization of bone. 
XIV. Antisense 

Antisense oligonucleotides are short synthetic nucleic acids that contain 
complementary base sequences to a targeted RNA. Hybridization of the RNA in living cells 
with the antisense oligonucleotide interferes with RNA function and ultimately blocks protein 
20 expression. Therefore, any gene for which the partial sequence is known can be targeted by 
an antisense oligonucleotide. 

Antisense technology is becoming a widely used research tool and will play an 
increasingly important role in the validation and elucidation of therapeutic targets identified 
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by genomic sequencing efforts. 

Antisense technology was developed to inhibit gene expression by utilizing an 
oligonucleotide complementary to the mRNA that encodes the target gene. There are several 
possible mechanisms for the inhibitory effects of antisense oligonucleotides. Among them, 
5 degradation of mRNA by RNase H is considered to be the major mechanism of inhibition of 
protein function. This technique was originally used to elucidate the function of a target 
gene, but may also have therapeutic applications, provided it is designed carefully and 
properly. 

An example of materials and methods for preparing antisense oligonucleotides can be 
10 performed as follows. Preliminary studies have been undertaken in collaboration with 

Sequiter (Natick, MA) using the antisense technology in the osteoblast-like murine cell line, 
MC3T3. These cells can be triggered to develop along the bone differentiation sequence. An 
initial proliferation period is characterized by minimal expression of differentiation markers 
and initial synthesis of collagenous extracellular matrix. Collagen matrix synthesis is 
15 required for subsequent induction of differentiation markers. Once the matrix synthesis 
begins, osteoblast marker genes are activated in a clear temporal sequence: alkaline 
phosphatase is induced at early times while bone sialoprotien and osteocalcin appear later in 
the differentiation process. This temporal sequence of gene expression is useful in 
monitoring the maturation and mineralization process. Matrix mineralization, which does not 
20 begin until several days after maturation has started, involves deposition of mineral on and 
within collagen fibrils deep within the matrix near the cell layer-culture plate interface. The 
collagen fibril-associated mineral formed by cultured osteoblasts resembles that found in 
woven bone in vivo and therefore is used frequently as a study reagent. 

MC3T3 cells were transfected with antisense oligonucleotides for the first week of the 
25 differentiation, according to the manufacturer's specifications (U.S. Patent No. 5,849,902). 
The oligonucleotides designed for Zmaxl are given below: 
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10875: AGUACAGCUUCUUGCCAACCCAGUC 
10876: UCCUCCAGGUCGAUGGUCAGCCCAU 
10877: GUCUGAGUCCGAGUUCAAAUCCAGG 

Figure 13 shows the results of antisense inhibition of Zmaxl in MC3T3 cells. The three 
5 oligonucleotides shown above were transfected into MC3T3 and RNA was isolated according 
to standard procedures. Northern analysis clearly shows markedly lower steady state levels 
of the Zmaxl transcript while the control gene GAPDH remained unchanged. Thus, 
antisense technology using the primers described above allows for the study of the role of 
Zmaxl expression on bone biology. 
10 The protein encoded by Zmaxl is related to the Low Density Lipoprotein receptor 

(LDL receptor). See, Goldstein et al, Ann. Rev. Cell Biology, 1:1-39 (1985); Brown et al, 
Science, 232:34-47 (1986). The LDL receptor is responsible for uptake of low density 
lipoprotein, a lipid-protein aggregate that includes cholesterol. Individuals with a defect in 
the LDL receptor are deficient in cholesterol removal and tend to develop artherosclerosis. In 
15 addition, cells with a defective LDL receptor show increased production of cholesterol, in 
part because of altered feedback regulation of cholesterol synthetic enzymes and in part 
because of increased transcription of the genes for these enzymes. In some cell types, 
cholesterol is a precursor for the formation of steroid hormones. 

Thus, the LDL receptor may, directly or indirectly, function as a signal transduction 
20 protein and may regulate gene expression. Because Zmaxl is related to the LDL receptor, 
this protein may also be involved in signaling between cells in a way that affects bone 
remodeling. 

The glycine 171 amino acid is likely to be important for the function of Zmaxl 
because this amino acid is also found in the mouse homologue of Zmaxl . The closely related 
25 LRP6 protein also contains glycine at the corresponding position (Brown et al, Biochemical 
and Biophysical Research Comm., 248:879-888 (1988)). Amino acids that are important in a 
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protein's structure or function tend to be conserved between species, because natural selection 
prevents mutations with altered amino acids at important positions from arising. 

In addition, the extracellular domain of Zmaxl contains four repeats consisting of five 
YWT motifs followed by an EFG motif. This 5YWT+EGF repeat is likely to form a distinct 
folded protein domain, as this repeat is also found in the LDL receptor and other LDL 
receptor-related proteins. The first three 5YWT+EGF repeats are very similar in their 
structure, while the fourth is highly divergent. Glycine 171 occurs in the central YWT motif 
of the first 5YWT+EGF repeat in Zmaxl. The other two similar 5YWT+EGF repeats of 
Zmaxl also contain glycine at the corresponding position, as does the 5YWT+EGF repeat in 
the LDL receptor protein. However, only 17.6% of the amino acids are identical among the 
first three 5YWT+EGF repeats in Zmaxl and the single repeat in the LDL receptor. These 
observations indicate that glycine 171 is essential to the function of this repeat, and mutation 
of glycine 171 causes a functional alteration of Zmaxl. The cDNA and peptide sequences are 
shown in Figs. 6A-6E. The critical base at nucleotide position 582 is indicated in bold and is 
underlined. 

Northern blot analysis (Figs. 7A-B) reveals that Zmaxl is expressed in human bone 
tissue as well as numerous other tissues. A multiple-tissue Northern blot (Clontech, Palo 
Alto, CA) was probed with exons from Zmaxl . As shown in Fig. 7A, the 5.5 kb Zmaxl 
transcript was highly expressed in heart, kidney, lung, liver and pancreas and is expressed at 
lower levels in skeletal muscle and brain. A second northern blot, shown in Fig. 7B, 
confirmed the transcript size at 5.5 kb, and indicated that Zmaxl is expressed in bone, bone 
marrow, calvaria and human osteoblastic cell lines. 

Taken together, these results indicate that the HBM polymorphism in the Zmaxl gene 
is responsible for the HBM phenotype, and that the Zmaxl gene is important in bone 
development. In addition, because mutation of Zmaxl can alter bone mineralization and 
development, it is likely that molecules that bind to Zmaxl may usefully alter bone 
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development. Such molecules may include, for example, small molecules, proteins, RNA 
aptamers, peptide aptamers, and the like. 

XV. Preparation of Nucleic Acids, Vectors, Transformations and Host Cells 

Large amounts of the nucleic acids of the present invention may be produced by 
replication in a suitable host cell. Natural or synthetic nucleic acid fragments coding for a 
desired fragment will be incorporated into recombinant nucleic acid constructs, usually DNA 
constructs, capable of introduction into and replication in a prokaryotic or eukaryotic cell. 
Usually the nucleic acid constructs will be suitable for replication in a unicellular host, such 
as yeast or bacteria, but may also be intended for introduction to (with and without integration 
within the genome) cultured mammalian or plant or other eukaryotic cell lines. The 
purification of nucleic acids produced by the methods of the present invention is described, 
for example, in Sambrook et al, Molecular Cloning. A Laboratory Manual, 2nd Ed. (Cold 
Spring Harbor Laboratory, Cold Spring Harbor, NY (1989) or Ausubel et al, Current 
Protocols in Molecular Biology, J. Wiley and Sons, NY (1992). 

The nucleic acids of the present invention may also be produced by chemical 
synthesis, e.g., by the phosphoramidite method described by Beaucage et al, Tetra. Letts., 
22:1859-1862 (1981) or the triester method according to Matteucci,et al, J. Am. Chem. Soc, 
103:3185 (1981), and may be performed on commercial, automated oligonucleotide 
synthesizers. A double-stranded fragment may be obtained from the single-stranded product 
of chemical synthesis either by synthesizing the complementary strand and annealing the 
strands together under appropriate conditions or by adding the complementary strand using 
DNA polymerase with an appropriate primer sequence. 

Nucleic acid constructs prepared for introduction into a prokaryotic or eukaryotic host 
may comprise a replication system recognized by the host, including the intended nucleic acid 
fragment encoding the desired protein, and will preferably also include transcription and 
translational initiation regulatory sequences operably linked to the protein encoding segment. 
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Expression vectors may include, for example, an origin of replication or autonomously 
replicating sequence (ARS) and expression control sequences, a promoter, an enhancer and 
necessary processing information sites, such as ribosome-binding sites, RNA splice sites, 
polyadenylation sites, transcriptional terminator sequences, and mRNA stabilizing sequences. 
5 Secretion signals may also be included where appropriate, whether from a native HBM or 
Zmaxl protein or from other receptors or from secreted proteins of the same or related 
species, which allow the protein to cross and/or lodge in cell membranes, and thus attain its 
functional topology, or be secreted from the cell. Such vectors may be prepared by means of 
standard recombinant techniques well known in the art and discussed, for example, in 
10 Sambrook et al, Molecular Cloning. A Laboratory Manual, 2nd Ed. (Cold Spring Harbor 

Laboratory, Cold Spring Harbor, NY (1989) or Ausubel et al, Current Protocols in Molecular 
Biology, J. Wiley and Sons, NY (1992). 

An appropriate promoter and other necessary vector sequences will be selected so as 
to be functional in the host, and may include, when appropriate, those naturally associated 
15 with Zmaxl or HBM genes. Examples of workable combinations of cell lines and expression 
vectors are described in Sambrook et al, Molecular Cloning. A Laboratory Manual, 2nd Ed. 
(Cold Spring Harbor Laboratory, Cold Spring Harbor, NY (1989) or Ausubel et al, Current 
Protocols in Molecular Biology, J. Wiley and Sons, NY (1992). Many useful vectors are 
known in the art and may be obtained from such vendors as Stratagene, New England 
20 BioLabs, Promega Biotech, and others. Promoters such as the trp, lac and phage promoters, 
tRNA promoters and glycolytic enzyme promoters may be used in prokaryotic hosts. Useful 
yeast promoters include promoter regions for metallothionein, 3-phosphoglycerate kinase or 
other glycolytic enzymes such as enolase or glyceraldehyde-3 -phosphate dehydrogenase, 
enzymes responsible for maltose and galactose utilization, and others. Vectors and promoters 
25 suitable for use in yeast expression are further described in EP 73,675A. Appropriate non- 
native mammalian promoters might include the early and late promoters from SV40 (Fiers et 
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al, Nature, 273:1 13 (1978)) or promoters derived from murine Moloney leukemia virus, 
mouse tumor virus, avian sarcoma viruses, adenovirus II, bovine papilloma virus or polyoma. 
In addition, the construct may be joined to an amplifiable gene (e.g., DHFR) so that multiple 
copies of the gene may be made. For appropriate enhancer and other expression control 
sequences, see also Enhancers and Eukaryotic Gene Expression, Cold Spring Harbor Press, 

Cold Spring Harbor, NY (1983). 

While such expression vectors may replicate autonomously, they may also replicate 
by being inserted into the genome of the host cell, by methods well known in the art. 

Expression and cloning vectors will likely contain a selectable marker, a gene 
encoding a protein necessary for survival or growth of a host cell transformed with the vector. 
The presence of this gene ensures growth of only those host cells which express the inserts. 
Typical selection genes encode proteins that a) confer resistance to antibiotics or other toxic 
substances, e.g. ampicillin, neomycin, methotrexate, etc.; b) complement auxotrophic 
deficiencies, or c) supply critical nutrients not available from complex media, e.g., the gene 
encoding D-alanine racemase for Bacilli. The choice of the proper selectable marker will 
depend on the host cell, and appropriate markers for different hosts are well known in the art. 

The vectors containing the nucleic acids of interest can be transcribed in vitro, and the 
resulting RNA introduced into the host cell by well-known methods, e.g., by injection (see, 
Kubo et al, FEBS Letts. 241:1 19 (1988)), or the vectors can be introduced directly into host 
cells by methods well known in the art, which vary depending on the type of cellular host, 
including electroporation; transfection employing calcium chloride, rubidium chloride, 
calcium phosphate, DEAE-dextran, or other substances; microprojectile bombardment; 
lipofection; infection (where the vector is an infectious agent, such as a retroviral genome); 
and other methods. See generally, Sambrook et al., 1989 and Ausubel et al., 1992. The 
introduction of the nucleic acids into the host cell by any method known in the art, including 
those described above, will be referred to herein as "transformation." The cells into which 
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have been introduced nucleic acids described above are meant to also include the progeny of 
such cells. 

Large quantities of the nucleic acids and proteins of the present invention may be 
prepared by expressing the Zmaxl or HBM nucleic acids or portions thereof in vectors or 
other expression vehicles in compatible prokaryotic or eukaryotic host cells. The most 
commonly used prokaryotic hosts are strains of Escherichia coli, although other prokaryotes, 
such as Bacillus subtilis or Pseudomonas may also be used. 

Mammalian or other eukaryotic host cells, such as those of yeast, filamentous fungi, 
plant, insect, or amphibian or avian species, may also be useful for production of the proteins 
of the present invention. Propagation of mammalian cells in culture is per se well known. 
See, Jakoby and Pastan (eds.), Cell Culture. Methods in Enzymology, volume 58, Academic 
Press, Inc., Harcourt Brace Jovanovich, NY, (1979)). Examples of commonly used 
mammalian host cell lines are VERO and HeLa cells, Chinese hamster ovary (CHO) cells, 
and WI38, BHK, and COS cell lines, although it will be appreciated by the skilled 
practitioner that other cell lines may be appropriate, e.g., to provide higher expression 
desirable glycosylation patterns, or other features. 

Clones are selected by using markers depending on the mode of the vector 
construction. The marker may be on the same or a different DNA molecule, preferably the 
same DNA molecule. In prokaryotic hosts, the transformant may be selected, e.g., by 
resistance to ampicillin, tetracycline or other antibiotics. Production of a particular product 
based on temperature sensitivity may also serve as an appropriate marker. 

Prokaryotic or eukaryotic cells transformed with the nucleic acids of the present 
invention will be useful not only for the production of the nucleic acids and proteins of the 
present invention, but also, for example, in studying the characteristics of Zmaxl or HBM 
proteins. 

Antisense nucleic acid sequences arc useful in preventing or diminishing the 
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expression of Zmaxl or HBM, as will be appreciated by one skilled in the art. For example, 
nucleic acid vectors containing all or a portion of the Zmaxl or HBM gene or other sequences 
from the Zmaxl or HBM region may be placed under the control of a promoter in an 
antisense orientation and introduced into a cell. Expression of such an antisense construct 
within a cell will interfere with Zmaxl or HBM transcription and/or translation and/or 
replication. 

The probes and primers based on the Zmaxl and HBM gene sequences disclosed 
herein are used to identify homologous Zmaxl and HBM gene sequences and proteins in 
other species. These Zmaxl and HBM gene sequences and proteins are used in the 
diagnostic/prognostic, therapeutic and drug screening methods described herein for the 
species from which they have been isolated. 
XVI. Protein Expression and Purification 

Expression and purification of the HBM protein of the invention can be performed 
essentially as outlined below. To facilitate the cloning, expression and purification of 
membrane and secreted protein from the HBM gene, a gene expression system, such as the 
pET System (Novagen), for cloning and expression of recombinant proteins in E. coli was 
selected. Also, a DNA sequence encoding a peptide tag, the His-Tap, was fused to the 3' end 
of DNA sequences of interest to facilitate purification of the recombinant protein products. 
The 3' end was selected for fusion to avoid alteration of any 5' terminal signal sequence. 

Nucleic acids chosen, for example, from the nucleic acids set forth in SEQ ID NOS: 1, 
3 and 5-12 for cloning HBM were prepared by polymerase chain reaction (PCR). Synthetic 
oligonucleotide primers specific for the 5' and 3 1 ends of the HBM nucleotide sequence were 
designed and purchased from Life Technologies (Gaithersburg, MD). All forward primers 
(specific for the 5' end of the sequence) were designed to include an Ncol cloning site at the 5' 
terminus. These primers were designed to permit initiation of protein translation at the 
methionine residue encoded within the Ncol site followed by a valine residue and the protein 
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encoded by the HBM DNA sequence. All reverse primers (specific for the 3' end of the 
sequence) included an EcoRI site at the 5' terminus to permit cloning of the HBM sequence 
into the reading frame of the pET-28b. The pET-28b vector provided a sequence encoding an 
additional 20 carboxyl-terminal amino acids including six histidine residues (at the C- 
terminus), which comprised the histidine affinity tag. 

Genomic DNA prepared from the HBM gene was used as the source of template DNA 
for PCR amplification (Ausubel et al, Current Protocols in Molecular Biology, John Wiley & 
Sons (1994)). To amplify a DNA sequence containing the HBM nucleotide sequence, 
genomic DNA (50 ng) was introduced into a reaction vial containing 2 mM MgCl 2 , 1 M 
synthetic oligonucleotide primers (forward and reverse primers) complementary to and 
flanking a defined HBM, 0.2 mM of each of deoxynucleotide triphosphate, dATP, dGTP, 
dCTP, dTTP and 2.5 units of heat stable DNA polymerase (Amplitaq, Roche Molecular 
Systems, Inc., Branchburg, NJ) in a final volume of 100 microliters. 

Upon completion of thermal cycling reactions, each sample of amplified DNA was 
purified using the Qiaquick Spin PCR purification kit (Qiagen, Gaithersburg, MD). All 
amplified DNA samples were subjected to digestion with the restriction endonucleases, e.g., 
Ncol and EcoRI (New England BioLabs, Beverly, MA) (Ausubel et al, Current Protocols in 
Molecular Biology, John Wiley & Sons, Inc. (1994)). DNA samples were then subjected to 
electrophoresis on 1.0% NuSeive (FMC BioProducts, Rockland, ME) agarose gels. DNA 
was visualized by exposure to ethidium bromide and long wave UV irradiation. DNA 
contained in slices isolated from the agarose gel was purified using the Bio 101 GeneClean 
Kit protocol (Bio 1 0 1 , Vista, C A) . 

The pET-28b vector was prepared for cloning by digestion with restriction 
endonucleases, e.g., Ncol and EcoRI (New England BioLabs, Beverly, MA) (Ausubel et al, 
Current Protocols in Molecular Biology, John Wiley & Sons, Inc. (1994)). The pET-28a 
vector, which encodes the histidine affinity tag that can be fused to the 5' end of an inserted 
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gene, was prepared by digestion with appropriate restriction endonucleases. 

Following digestion, DNA inserts were cloned (Ausubel et al, Current Protocols in 
Molecular Biology, John Wiley & Sons, Inc. (1994)) into the previously digested pET-28b 
expression vector. Products of the ligation reaction were then used to transform the BL21 
5 strain ofE. coli (Ausubel et al, Current Protocols in Molecular Biology, John Wiley & Sons, 
Inc. ( 1 994)) as described below. 

Competent bacteria, E. coli strain BL21 or E. coli strain BL21 (DE3), were 
transformed with recombinant pET expression plasmids carrying the cloned HBM sequence 
according to standard methods (Ausubel et al, Current Protocols in Molecular Biology, John 
10 Wiley & Sons, Inc. (1994)). Briefly, 1 1 of ligation reaction was mixed with 50 1 of 
electrocompetent cells and subjected to a high voltage pulse, after which samples were 
incubated in 0.45 ml SOC medium (0.5% yeast extract, 2.0% tryptone, 10 mM NaCl, 2.5 mM 
KC1, 10 mM MgCl 2 , 10 mM MgS0 4 and 20 mM glucose) at 37 C with shaking for 1 hour. 
Samples were then spread on LB agar plates containing 25 jig/ml kanamycin sulfate for 
15 growth overnight. Transformed colonies of BL21 were then picked and analyzed to evaluate 
cloned inserts, as described below. 

Individual BL21 clones transformed with recombinant pET-28b HBM nucleotide 
sequences were analyzed by PCR amplification of the cloned inserts using the same forward 
and reverse primers specific for the HBM sequences that were used in the original PCR 
20 amplification cloning reactions. Successful amplification verifies the integration of the HBM 
sequence in the expression vector (Ausubel et al, Current Protocols in Molecular Biology, 
John Wiley & Sons, Inc. (1994)). 

Individual clones of recombinant pET-28b vectors carrying properly cloned HBM 
nucleotide sequences were picked and incubated in 5 ml of LB broth plus 25 ng/ml 
25 kanamycin sulfate overnight. The following day plasmid DNA was isolated and purified 
using the Qiagen plasmid purification protocol (Qiagen Inc., Chatsworth, CA). 
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The pET vector can be propagated in any E. coli K-12 strain, e.g., HMS174, HB101, 
JM1 09, DH5 and the like, for purposes of cloning or plasmid preparation. Hosts for 
expression include E. coli strains containing a chromosomal copy of the gene for T7 RNA 
polymerase. These hosts were lysogens of bacteriophage DE3, a lambda derivative that 
5 carries the lad gene, the lacUV5 promoter and the gene for T7 RNA polymerase. T7 RNA 
polymerase was induced by addition of isopropyl-p-D-thiogalactoside (IPTG), and the T7 
RNA polymerase transcribes any target plasmid containing a functional T7 promoter, such as 
pET-28b, carrying its gene of interest. Strains include, for example, BL21(DE3) (Studier et 
al, Meth. Enzymol, 185:60-89 (1990)). 
10 To express the recombinant HBM sequence, 50 ng of plasmid DNA are isolated as 

described above to transform competent BL21(DE3) bacteria as described above (provided by 
Novagen as part of the pET expression kit). The lacZ gene (P-galactosidase) is expressed in 
the pET- System as described for the HBM recombinant constructions. Transformed cells 
were cultured in SOC medium for 1 hour, and the culture was then plated on LB plates 
15 containing 25 ug/ml kanamycin sulfate. The following day, the bacterial colonies were 
pooled and grown in LB medium containing kanamycin sulfate (25 ug/ml) to an optical 
density at 600 nM of 0.5 to 1 .0 O.D. units, at which point 1 mM IPTG was added to the 
culture for 3 hours to induce gene expression of the HBM recombinant DNA constructions. 
After induction of gene expression with IPTG, bacteria were collected by 
20 centrifugation in a Sorvall RC-3B centrifuge at 3500 x g for 1 5 minutes at 4 C. Pellets were 
resuspended in 50 ml of cold mM Tris-HCl, pH 8.0, 0.1 M NaCl and 0.1 mM EDTA (STE 
buffer). Cells were then centrifuged at 2000 x g for 20 minutes at 4 C. Wet pellets were 
weighed and frozen at -80 C until ready for protein purification. 

A variety of methodologies known in the art can be used to purify the isolated 
25 proteins (Coligan et al, Current Protocols in Protein Science, John Wiley & Sons (1995)). 
For example, the frozen cells can be thawed, resuspended in buffer and ruptured by several 
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passages through a small volume microfluidizer (Model M-l 10S, Microfluidics International 
Corp., Newton, MA). The resultant homogenate is centrifuged to yield a clear supernatant 
(crude extract) and, following filtration, the crude extract is fractioned over columns. 
Fractions are monitored by absorbance at OD 280 nm and peak fractions may be analyzed by 
SDS-PAGE. 

The concentrations of purified protein preparations are quantified 
spectrophotometrically using absorbance coefficients calculated from amino acid content 
(Perkins, Eur. J. Biochem., 157:169-180 (1986)). Protein concentrations are also measured 
by the method of Bradford, Anal. Biochem., 72:248-254 (1976) and Lowry et al, J. Biol. 
Chem., 193:265-275 (1951) using bovine serum albumin as a standard. 

SDS-polyacrylamide gels of various concentrations were purchased from BioRad 
(Hercules, CA), and stained with Coomassie blue. Molecular weight markers may include 
rabbit skeletal muscle myosin (200 kDa), E. coli (3-galactosidase (1 16 kDa), rabbit muscle 
phosphorylase B (97.4 kDa), bovine serum albumin (66.2 kDa), ovalbumin (45 kDa), bovine 
carbonic anyhdrase (31 kDa), soybean trypsin inhibitor (21.5 kDa), egg white lysozyme (14.4 
kDa) and bovine aprotinin (6.5 kDa). 

Once a sufficient quantity of the desired protein has been obtained, it may be used for 
various purposes. A typical use is the production of antibodies specific for binding. These 
antibodies may be either polyclonal or monoclonal, and may be produced by in vitro or in 
vivo techniques well known in the art. Monoclonal antibodies to epitopes of any of the 
peptides identified and isolated as described can be prepared from murine hybridomas 
(Kohler, Nature, 256:495 (1975)). In summary, a mouse is inoculated with a few micrograms 
of HBM protein over a period of two weeks. The mouse is then sacrificed. The cells that 
produce antibodies are then removed from the mouse's spleen. The spleen cells are then 
fused with polyethylene glycol with mouse myeloma cells. The successfully fused cells are 
diluted in a microtiter plate and growth of the culture is continued. The amount of antibody 
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per well is measured by immunoassay methods such as ELISA (Engvall, Meth. Enzymol, 
70:419 (1980)). Clones producing antibody can be expanded and further propagated to 
produce HBM antibodies. Other suitable techniques involve in vitro exposure of 
lymphocytes to the antigenic polypeptides, or alternatively, to selection of libraries of 
5 antibodies in phage or similar vectors. See Huse et al, Science, 246: 1275-128 1 (1989). For 
additional information on antibody production see Davis et al, Basic Methods in Molecular 
Biology, Elsevier, NY, Section 21-2 (1989). 
XVII. Methods of Use: Gene Therapy 

In recent years, significant technological advances have been made in the area of gene 
10 therapy for both genetic and acquired diseases. (Kay et al, Proc. Natl. Acad. Sci. USA, 

94:12744-12746 (1997)) Gene therapy can be defined as the deliberate transfer of DNA for 
therapeutic purposes. Improvement in gene transfer methods has allowed for development of 
gene therapy protocols for the treatment of diverse types of diseases. Gene therapy has also 
taken advantage of recent advances in the identification of new therapeutic genes, 
15 improvement in both viral and nonviral gene delivery systems, better understanding of gene 
regulation, and improvement in cell isolation and transplantation. 

The preceding experiments identify the HBM gene as a dominant mutation conferring 
elevated bone mass. The fact that this mutation is dominant indicates that expression of the 
HBM protein causes elevated bone mass. Older individuals carrying the HBM gene, and, 
20 therefore expressing the HBM protein, do not suffer from osteoporosis. These individuals are 
equivalent to individuals being treated with the HBM protein. These observations are a 
strong experimental indication that therapeutic treatment with the HBM protein prevents 
osteoporosis. The bone mass elevating activity of the HBM gene is termed "HBM function." 
Therefore, according to the present invention, a method is also provided of supplying 
25 HBM function to mesenchymal stem cells (Onyia et al, J. Bone Miner. Res., 13:20-30 (1998); 
Ko et al, Cancer Res., 56:4614-4619 (1996)). Supplying such a function provides protection 
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against osteoporosis. The HBM gene or a part of the gene may be introduced into the cell in 
a vector such that the gene remains extrachromosomal. In such a situation, the gene will be 
expressed by the cell from the extrachromosomal location. 

Vectors for introduction of genes both for recombination and for extrachromosomal 
5 maintenance are known in the art, and any suitable vector may be used. Methods for 

introducing DNA into cells such as electroporation, calcium phosphate co-precipitation, and 
viral transduction are known in the art, and the choice of method is within the competence of 
one skilled in the art (Robbins, Ed., Gene Therapy Protocols, Human Press, NJ (1997)). 
Cells transformed with the HBM gene can be used as model systems to study osteoporosis 
10 and drug treatments that promote bone growth. 

As generally discussed above, the HBM gene or fragment, where applicable, may be 
used in gene therapy methods in order to increase the amount of the expression products of 
such genes in mesenchymal stem cells. It may be useful also to increase the level of 
expression of a given HBM protein, or a fragment thereof, even in those cells in which the 
15 wild type gene is expressed normally. Gene therapy would be carried out according to 
generally accepted methods as described by, for example, Friedman, Therapy for Genetic 
Diseases, Friedman, Ed., Oxford University Press, pages 105-121 (1991). 

A virus or plasmid vector containing a copy of the HBM gene linked to expression 
control elements and capable of replicating inside mesenchymal stem cells, is prepared. 
20 Suitable vectors are known and described, for example, in U.S. Patent No. 5,252,479 and WO 
93/07282, the disclosures of which are incorporated by reference herein in their entirety. The 
vector is then injected into the patient, either locally into the bone marrow or systemically (in 
order to reach any mesenchymal stem cells located at other sites, i.e., in the blood). If the 
transfected gene is not permanently incorporated into the genome of each of the targeted 
25 cells, the treatment may have to be repeated periodically. 

Gene transfer systems known in the art may be useful in the practice of the gene 
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therapy methods of the present invention. These include viral and non-viral transfer methods. 
A number of viruses have been used as gene transfer vectors, including polyoma, i.e., SV40 
(Madzak et al, J. Gen. Virol, 73:1533-1536 (1992)), adenovirus (Berkner, Curr. Top. 
Microbiol. Immunol, 158:39-61 (1992); Berkner et al, Bio Techniques, 6:616-629 (1988); 
5 Gorziglia et al, J. Virol, 66:4407-4412 (1992); Quantin et al, Proc. Natl Acad. Sci. USA, 
89:2581-2584 (1992); Rosenfeld et al, Cell, 68:143-155 (1992); Wilkinson et al, Nucl. Acids 
Res., 20:2233-2239 (1992); Stratford-Perricaudet et al, Hum. Gene Ther., 1:241-256 (1990)), 
vaccinia virus (Mackett et al, Biotechnology, 24:495-499 (1992)), adeno-associated virus 
(Muzyczka, Curr. Top. Microbiol Immunol, 158:91-123 (1992); Ohi et al, Gene, 89:279-282 
10 (1990)), herpes viruses including HSV and EBV (Margolskee, Curr. Top. Microbiol. 

Immunol, 158:67-90 (1992); Johnson et al, J. Virol, 66:2952-2965 (1992); Fink et al, Hum. 
Gene Ther., 3:11-19 (1992); Breakfield et al, Mol. Neurobiol, 1:337-371 (1987;) Fresse et al, 
Biochem. Pharmacol, 40:2189-2199 (1990)), and retroviruses of avian (Brandyopadhyay et 
al, Mol Cell Biol, 4:749-754 (1984); Petropouplos et al, J. Virol, 66:3391-3397 (1992)), 
15 murine (Miller, Curr. Top. Microbiol Immunol, 158:1-24 (1992); Miller et al, Mol. Cell 
Biol, 5:431-437 (1985); Sorge et al, Mol Cell Biol, 4:1730-1737 (1984); Mann et al, J. 
Virol, 54:401-407 (1985)), and human origin (Page et al, J. Virol, 64:5370-5276 (1990); 
Buchschalcher et al, J. Virol, 66:2731-2739 (1992)). Most human gene therapy protocols 
have been based on disabled murine retroviruses. 
20 Non- viral gene transfer methods known in the art include chemical techniques such as 

calcium phosphate coprecipitation (Graham et al, Virology, 52:456-467 (1973); Pellicer et al, 
Science, 209:1414-1422 (1980)), mechanical techniques, for example microinjection 
(Anderson et al, Proc. Natl. Acad. Sci. USA, 77:5399-5403 (1980); Gordon et al, Proc. Natl. 
Acad. Sci. USA, 77:7380-7384 (1980); Brinster et al, Cell, 27:223-231 (1981); Constantini et 
25 al, Nature, 294:92-94 (1981)), membrane fusion-mediated transfer via liposomes (Feigner et 
al, Proc. Natl. Acad. Sci. USA, 84:7413-7417 (1987); Wang et al, Biochemistry, 28:9508- 
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9514 (1989); Kaneda et al, J. Biol. Chem., 264:12126-12129 (1989); Stewart et al, Hum. 
Gene Ther., 3:267-275 (1992); Nabel et al, Science, 249:1285-1288 (1990); Lim et al, 
Circulation, 83:2007-201 1 (1992)), and direct DNA uptake and receptor-mediated DNA 
transfer (Wolff et al, Science, 247:1465-1468 (1990); Wu et al, BioTechniques , 11:474-485 
5 (1991); Zenke et al, Proc. Natl. Acad. Sci. USA, 87:3655-3659 (1990); Wu et al, J. Biol. 

Chem., 264:16985-16987 (1989); Wolff et al, BioTechniques, 11:474-485 (1991); Wagner et 
al, 1990; Wagner et al, Proc. Natl. Acad. Sci. USA, 88:4255-4259 (1991); Cotten et al, Proc. 
Natl. Acad. Sci. USA, 87:4033-4037 (1990); Curiel et al, Proc. Natl. Acad. Sci. USA, 
88:8850-8854 (1991); Curiel et al, Hum. Gene Ther., 3:147-154 (1991)). Viral-mediated 
10 gene transfer can be combined with direct in vivo vectors to the mesenchymal stem cells and 
not into the surrounding cells (Romano et al, In Vivo, 12(l):59-67 (1998); Gonez et al, Hum. 
Mol. Genetics, 7(12):1913-9 (1998)). Alternatively, the retroviral vector producer cell line 
can be injected into the bone marrow (Culver et al, Science, 256:1550-1552 (1992)). 
Injection of producer cells would then provide a continuous source of vector particles. This 
15 technique has been approved for use in humans with inoperable brain tumors. 

In an approach which combines biological and physical gene transfer methods, 
plasmid DNA of any size is combined with a polylysine-conjugated antibody specific to the 
adenovirus hexon protein, and the resulting complex is bound to an adenovirus vector. The 
trimolecular complex is then used to infect cells. The adenovirus vector permits efficient 
20 binding, internalization, and degradation of the endosome before the coupled DNA is 
damaged. 

Liposome/DNA complexes have been shown to be capable of mediating direct in vivo 
gene transfer. While in standard liposome preparations the gene transfer process is non- 
specific, localized in vivo uptake and expression have been reported in tumor deposits, for 
25 example, following direct in situ administration (Nabel, Hum. Gene Ther., 3:399-410 (1992)). 
XVIII. Methods of Use: Transformed Hosts, Development of Pharmaceuticals and 
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Research Tools 

Cells and animals that carry the HBM gene can be used as model systems to study and 
test for substances that have potential as therapeutic agents (Onyia et al, J. Bone Miner. Res., 
13:20-30 (1998); Broder et al, Bone, 21:225-235 (1997)). The cells are typically cultured 
5 mesenchymal stem cells. These may be isolated from individuals with somatic or germline 
HBM genes. Alternatively, the cell line can be engineered to carry the HBM gene, as 
described above. After a test substance is applied to the cells, the transformed phenotype of 
the cell is determined. Any trait of transformed cells can be assessed, including formation of 
bone matrix in culture (Broder et al, Bone, 21:225-235 (1997)), mechanical properties (Kizer 
10 et al, Proc. Natl Acad. Sci. USA, 94:1013-1018 (1997)), and response to application of 
putative therapeutic agents. 

Animals for testing therapeutic agents can be selected after treatment of germline cells 
or zygotes. Such treatments include insertion of the Zmaxl gene, as well as insertion of the 
HBM gene and disrupted homologous genes. Alternatively, the inserted Zmaxl gene(s) 
15 and/or HBM gene(s) of the animals may be disrupted by insertion or deletion mutation of 
other genetic alterations using conventional techniques, such as those described by, for 
example, Capechi, Science, 244:1288 (1989); Valancuis et al, Mol Cell Biol, 11:1402 
(1991); Hasty et al, Nature, 350:243 (1991); Shinkai et al, Cell 68:855 (1992); Mombaerts et 
al, Cell, 68:869 (1992); Philpott et al, Science, 256:1448 (1992); Snouwaert et al, Science, 
20 257:1083 (1992); Donehower et al, Nature, 356:215 (1992). After test substances have been 
administered to the animals, the growth of bone must be assessed. If the test substance 
enhances the growth of bone, then the test substance is a candidate therapeutic agent. These 
animal models provide an extremely important vehicle for potential therapeutic products. 

Individuals carrying the HBM gene have elevated bone mass. The HBM gene causes 
25 this phenotype by altering the activities, levels, expression patterns, and modification states of 
other molecules involved in bone development. Using a variety of established techniques, it 
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is possible to identify molecules, preferably proteins or mRNAs, whose activities, levels, 
expression patterns, and modification states are different between systems containing the 
Zmax 1 gene and systems containing the HBM gene. Such systems can be, for example, cell- 
free extracts, cells, tissues or living organisms, such as mice or humans. For a mutant form of 
5 Zmaxl , a complete deletion of Zmaxl , mutations lacking the extracellular or intracellular 
portion of the protein, or any other mutation in the Zmaxl gene may be used. It is also 
possible to use expression of antisense Zmaxl RNA or oligonucleotides to inhibit production 
of the Zmaxl protein. For a mutant form of HBM, a complete deletion of HBM, mutations 
lacking the extracellular or intracellular portion of the HBM protein, or any other mutation in 
10 the HBM gene may be used. It is also possible to use expression of antisense HBM RNA or 
oligonucleotides to inhibit production of the HBM protein. 

Molecules identified by comparison of Zmaxl systems and HBM systems can be used 
as surrogate markers in pharmaceutical development or in diagnosis of human or animal bone 
disease. Alternatively, such molecules may be used in treatment of bone disease. See, 
15 Schena et al, Science, 270:467-470 (1995). 

For example, a transgenic mouse carrying the HBM gene in the mouse homologue is 
constructed. A mouse of the genotype HBM/+ is viable, healthy and has elevated bone mass. 
To identify surrogate markers for elevated bone mass, HBM/+ (i.e., heterozygous) and 
isogenic +/+ (i.e., wild-type) mice are sacrificed. Bone tissue mRNA is extracted from each 
20 animal, and a "gene chip" corresponding to mRNAs expressed in the +/+ individual is 

constructed. mRNA from different tissues is isolated from animals of each genotype, reverse- 
transcribed, fluorescently labeled, and then hybridized to gene fragments affixed to a solid 
support. The ratio of fluorescent intensity between the two populations is indicative of the 
relative abundance of the specific mRNAs in the +/+ and HBM/+ animals. Genes encoding 
25 mRNAs over- and under-expressed relative to the wild-type control are candidates for genes 
coordinately regulated by the HBM gene. 
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One standard procedure for identification of new proteins that are part of the same 
signaling cascade as an already-discovered protein is as follows. Cells are treated with 
radioactive phosphorous, and the already-discovered protein is manipulated to be more ore 
less active. The phosphorylation state of other proteins in the cell is then monitored by 
polyacrylamide gel electrophoresis and autoradiography, or similar techniques. Levels of 
activity of the known protein may be manipulated by many methods, including, for example, 
comparing wild-type mutant proteins using specific inhibitors such as drugs or antibodies, 
simply adding or not adding a known extracellular protein, or using antisense inhibition of the 
expression of the known protein (Tamura et al, Science, 280(5369):1614-7 (1998); Meng, 
EMBOJ., 17(15):4391-403 (1998); Cooper et al, Cell, 1:263-73 (1982)). 

In another example, proteins with different levels of phosphorylation are identified in 
TE85 osteosarcoma cells expressing either a sense or antisense cDNA for Zmaxl. TE85 cells 
normally express high levels of Zmaxl (Dong et al, Biochem. & Biophys. Res. Comm., 
251:784-790 (1998)). Cells containing the sense construct express even higher levels of 
Zmaxl, while cells expressing the antisense construct express lower levels. Cells are grown 
in the presence of 32 P, harvested, lysed, and the lysates run on SDS polyacrylamide gels to 
separate proteins, and the gels subjected to autoradiography (Ausubel et al, Current Protocols 
in Molecular Biology, John Wiley & Sons (1997)). Bands that differ in intensity between the 
sense and antisense cell lines represent phosphoproteins whose phosphorylation state or 
absolute level varies in response to levels of Zmaxl. As an alternative to the 32 P-labeling, 
unlabeled proteins may be separated by SDS-PAGE and subjected to immunoblotting, using 
the commercially available anti-phosphotyrosine antibody as a probe (Thomas et al, Nature, 
376(6537):267-71 (1995)). As an alternative to the expression of antisense RNA, transfection 
with chemically modified antisense oligonucleotides can be used (Woolf et al, Nucleic Acids 
Res., 18(7):1763-9(1990)). 

Many bone disorders, such as osteoporosis, have a slow onset and a slow response to 
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treatment. It is therefore useful to develop surrogate markers for bone development and 
mineralization. Such markers can be useful in developing treatments for bone disorders, and 
for diagnosing patients who may be at risk for later development of bone disorders. 
Examples of preferred markers are N- and C-terminal telopeptide markers described, for 
5 example, in U.S. Patent Nos. 5,455,179, 5,641,837 and 5,652,1 12, the disclosures of which 
are incorporated by reference herein in their entirety. In the area of HIV disease, CD4 counts 
and viral load are useful surrogate markers for disease progression (Vlahov et al, JAMA, 
279(l):35-40 (1998)). There is a need for analogous surrogate markers in the area of bone 
disease. 

10 A surrogate marker can be any characteristic that is easily tested and relatively 

insensitive to non-specific influences. For example, a surrogate marker can be a molecule 
such as a protein or mRNA in a tissue or in blood serum. Alternatively, a surrogate marker 
may be a diagnostic sign such as sensitivity to pain, a reflex response or the like. 

In yet another example, surrogate markers for elevated bone mass are identified using 

15 a pedigree of humans carrying the HBM gene. Blood samples are withdrawn from three 

individuals that carry the HBM gene, and from three closely related individuals that do not. 
Proteins in the serum from these individuals are electrophoresed on a two dimensional gel 
system, in which one dimension separates proteins by size, and another dimension separates 
proteins by isoelectric point (Epstein et al, Electrophoresis, 17(1 1):1655-70 (1996)). Spots 

20 corresponding to proteins are identified. A few spots are expected to be present in different 
amounts or in slightly different positions for the HBM individuals compared to their normal 
relatives. These spots correspond to proteins that are candidate surrogate markers. The 
identities of the proteins are determined by microsequencing, and antibodies to the proteins 
can be produced by standard methods for use in diagnostic testing procedures. Diagnostic 
25 assays for HBM proteins or other candidate surrogate markers include using antibodies 
described in this invention and a reporter molecule to detect HBM in human body fluids, 
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membranes, bones, cells, tissues or extracts thereof. The antibodies can be labeled by joining 
them covalently or noncovalently with a substance that provides a detectable signal. In many 
scientific and patent literature, a variety of reporter molecules or labels are described 
including radionuclides, enzymes, fluorescent, chemi-luminescent or chromogenic agents 
5 (U.S. Patent Nos. 3,817,837; 3,850,752; 3,939,350; 3,996,345; 4,277,437; 4,275,149; and 
4,366,241). 

Using these antibodies, the levels of candidate surrogate markers are measured in 
normal individuals and in patients suffering from a bone disorder, such as osteoporosis, 
osteoporosis pseudoglioma, Engelmann's disease, Ribbing's disease, hyperphosphatasemia, 

10 Van Buchem's disease, melorheostosis, osteopetrosis, pychodysostosis, sclerosteosis, 

osteopoikilosis, acromegaly, Paget's disease, fibrous dysplasia, tubular stenosis, osteogenesis 
imperfecta, hypoparathyroidism, pseudohypoparathyroidism, 
pseudopseudohypoparathyroidism, primary and secondary hyperparathyroidism and 
associated syndromes, hypercalciuria, medullary carcinoma of the thyroid gland, 

15 osteomalacia and other diseases. Techniques for measuring levels of protein in serum in a 
clinical setting using antibodies are well established. A protein that is consistently present in 
higher or lower levels in individuals carrying a particular disease or type of disease is a useful 
surrogate marker. 

A surrogate marker can be used in diagnosis of a bone disorder. For example, 
20 consider a child that present to a physician with a high frequency of bone fracture. The 

underlying cause may be child abuse, inappropriate behavior by the child, or a bone disorder. 
To rapidly test for a bone disorder, the levels of the surrogate marker protein are measured 
using the antibody described above. 

Levels of modification states of surrogate markers can be measured as indicators of 
25 the likely effectiveness of a drug that is being developed. It is especially convenient to use 
surrogate markers in creating treatments for bone disorders, because alterations in bone 
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development or mineralization may require a long time to be observed. For example, a set of 
bone mRNAs, termed the "HBM-inducible mRNA set" is found to be overexpressed in 
HBM/+ mice as compared to +/+ mice, as described above. Expression of this set can be 
used as a surrogate marker. Specifically, if treatment of +/+ mice with a compound results in 
overexpression of the HBM-inducible mRNA set, then that compound is considered a 
promising candidate for further development. 

This invention is particularly useful for screening compounds by using the Zmaxl or 
HBM protein or binding fragment thereof in any of a variety of drug screening techniques. 

The Zmaxl or HBM protein or fragment employed in such a test may either be free in 
solution, affixed to a solid support, or borne on a cell surface. One method of drug screening 
utilizes eukaryotic or prokaryotic host cells which are stably transformed with recombinant 
nucleic acids expressing the protein or fragment, preferably in competitive binding assays. 
Such cells, either in viable or fixed form, can be used for standard binding assays. One may 
measure, for example, for the formation of complexes between a Zmaxl or HBM protein or 
fragment and the agent being tested, or examine the degree to which the formation of a 
complex between a Zmaxl or HBM protein or fragment and a known ligand is interfered with 

by the agent being tested. 

Thus, the present invention provides methods of screening for drugs comprising 
contacting such an agent with a Zmaxl or HBM protein or fragment thereof and assaying (i) 
for the presence of a complex between the agent and the Zmaxl or HBM protein or fragment, 
or (ii) for the presence of a complex between the Zmaxl or HBM protein or fragment and a 
ligand, by methods well known in the art. In such competitive binding assays the Zmaxl or 
HBM protein or fragment is typically labeled. Free Zmaxl or HBM protein or fragment is 
separated from that present in a proteimprotein complex, and the amount of free (i.e., 
uncomplexed) label is a measure of the binding of the agent being tested to Zmaxl or HBM 
or its interference with Zmaxl or HBM: ligand binding, respectively. 

-95- 



Atty. Dkt. No.: 032796-014 
Another technique for drug screening provides high throughput screening for 
compounds having suitable binding affinity to the Zmaxl or HBM proteins and is described 
in detail in WO 84/03564. Briefly stated, large numbers of different small peptide test 
compounds are synthesized on a solid substrate, such as plastic pins or some other surface. 
The peptide test compounds are reacted with Zmaxl or HBM proteins and washed. Bound 
Zmaxl or HBM protein is then detected by methods well known in the art. Purified Zmaxl 
or HBM can be coated directly onto plates for use in the aforementioned drug screening 
techniques. However, non-neutralizing antibodies to the protein can be used to capture 
antibodies to immobilize the Zmaxl or HBM protein on the solid phase. 

This invention also contemplates the use of competitive drug screening assays in 
which neutralizing antibodies capable of specifically binding the Zmaxl or HBM protein 
compete with a test compound for binding to the Zmaxl or HBM protein or fragments 
thereof. In this manner, the antibodies can be used to detect the presence of any peptide that 
shares one or more antigenic determinants of the Zmaxl or HBM protein. 

A further technique for drug screening involves the use of host eukaryotic cell lines or 
cells (such as described above) that have a nonfunctional Zmaxl or HBM gene. These host 
cell lines or cells are defective at the Zmaxl or HBM protein level. The host cell lines or 
cells are grown in the presence of drug compound. The rate of growth of the host cells is 
measured to determine if the compound is capable of regulating the growth of Zmaxl or 
HBM defective cells. 

The goal of rational drug design is to produce structural analogs of biologically active 
proteins of interest or of small molecules with which they interact (e.g., agonists, antagonists, 
inhibitors) in order to fashion drugs which are, for example, more active or stable forms of 
the protein, or which, e.g., enhance or interfere with the function of a protein in vivo. See, 
e.g., Hodgson, Bio/Technology, 9:19-21 (1991). In one approach, one first determines the 
three-dimensional structure of a protein of interest (e.g., Zmaxl or HBM protein) or, for 
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example, of the Zmaxl- or HBM-receptor or ligand complex, by x-ray crystallography, by 
computer modeling or most typically, by a combination of approaches. Less often, useful 
information regarding the structure of a protein may be gained by modeling based on the 
structure of homologous proteins. An example of rational drug design is the development of 
HIV protease inhibitors (Erickson et al, Science, 249:527-533 (1990)). In addition, peptides 
(e.g., Zmaxl or HBM protein) are analyzed by an alanine scan (Wells, Methods in Enzymol, 
202:390-41 1 (1991)). hi this technique, an amino acid residue is replaced by Ala, and its 
effect on the peptide's activity is determined. Each of the amino acid residues of the peptide is 
analyzed in this manner to determine the important regions of the peptide. 

It is also possible to isolate a target-specific antibody, selected by a functional assay, 
and then to solve its crystal structure. In principle, this approach yields a pharmacore upon 
which subsequent drug design can be based. It is possible to bypass protein crystallography 
altogether by generating anti-idiotypic antibodies (anti-ids) to a functional, pharmacologically 
active antibody. As a mirror image of a mirror image, the binding site of the anti-ids would 
be expected to be an analog of the original receptor. The anti-id could then be used to 
identify and isolate peptides from banks of chemically or biologically produced banks of 
peptides. Selected peptides would then act as the pharmacore. 

Thus, one may design drugs which have, e.g., improved Zmaxl or HBM protein 
activity or stability or which act as inhibitors, agonists, antagonists, etc. of Zmaxl or HBM 
protein activity. By virtue of the availability of cloned Zmaxl or HBM sequences, sufficient 
amounts of the Zmaxl or HBM protein may be made available to perform such analytical 
studies as x-ray crystallography. In addition, the knowledge of the Zmaxl or HBM protein 
sequence provided herein will guide those employing computer modeling techniques in place 
of, or in addition to x-ray crystallography. 

XIX. Methods of Use: Avian and Mammalian Animal Husbandry 

The Zmaxl DNA and Zmaxl protein and/or the HBM DNA and HBM protein can be 
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used for vertebrate and preferably human therapeutic agents and for avian and mammalian 
veterinary agents, including for livestock breeding. Birds, including, for example, chickens, 
roosters, hens, turkeys, ostriches, ducks, pheasants and quails, can benefit from the 
identification of the gene and pathway for high bone mass. In many examples cited in 
literature (for example, McCoy et al, Res. Vet. ScL, 60(2):185-186 (1996)), weakened bones 
due to husbandry conditions cause cage layer fatigue, osteoporosis and high mortality rates. 
Additional therapeutic agents to treat osteoporosis or other bone disorders in birds can have 
considerable beneficial effects on avian welfare and the economic conditions of the livestock 
industry, including, for example, meat and egg production. 
) XX. Methods of use: Diagnostic assays using Zmaxl -specific oligonucleotides for 
detection of genetic alterations affecting bone development. 

In cases where an alteration or disease of bone development is suspected to involve an 
alteration of the Zmaxl gene or the HBM gene, specific oligonucleotides may be constructed 
and used to assess the level of Zmaxl mRNA or HBM mRNA, respectively, in bone tissue or 
5 in another tissue that affects bone development. 

For example, to test whether a person has the HBM gene, which affects bone density, 
polymerase chain reaction can be used. Two oligonucleotides are synthesized by standard 
methods or are obtained from a commercial supplier of custom-made oligonucleotides. The 
length and base composition are determined by standard criteria using the Oligo 4.0 primer 
20 Picking program (Wojchich Rychlik, 1 992). One of the oligonucleotides is designed so that 
it will hybridize only to HBM DNA under the PCR conditions used. The other 
oligonucleotide is designed to hybridize a segment of Zmaxl genomic DNA such that 
amplification of DNA using these oligonucleotide primers produces a conveniently identified 
DNA fragment. For example, the pair of primers CCAAGTTCTGAGAAGTCC (SEQ ID 
25 NO:32) and AATACCTGAAACCA TACCTG (SEQ ID NO:33) will amplify a 530 base pair 
DNA fragment from a DNA sample when the following conditions are used: step 1 at 95 C for 
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120 seconds; step 2 at 95 C for 30 seconds; step 3 at 58 C for 30 seconds; step 4 at 72 C for 
120 seconds; where steps 2-4 are repeated 35 times. Tissue samples may be obtained from 
hair follicles, whole blood, or the buccal cavity. 

The fragment generated by the above procedure is sequenced by standard techniques. 
Individuals heterozygous for the HBM gene will show an equal amount of G and T at the 
second position in the codon for glycine 171. Normal or homozygous wild-type individuals 
will show only G at this position. 

Other amplification techniques besides PCR may be used as alternatives, such as 
ligation-mediated PCR or techniques involving Q-beta replicase (Cahill et al, Clin. Chem., 
37(9):1482-5 (1991)). For example, the oligonucleotides AGCTGCTCGTAGCT 
G TCTCTCCCTGGATCACGGGTACATGTACTGGACAGACTGGGT (SEQ ID NO:34) 
and TGAGACGCCCCGGATTGAGCGGGCAGGGATAGCTTATTCCCTGT 
GCCGCATTACGGC (SEQ ID NO:35) can be hybridized to a denatured human DNA 
sample, treated with a DNA ligase, and then subjected to PCR amplification using the primer 
oligonucleotides AGCTGCTCGTAG CTGTCTCTCCCTGGA (SEQ ID NO:36) and 
GCCGTAATGCGGCACAGGGAATAAGCT (SEQ ID NO:37). In the first two 
oligonucleotides, the outer 27 bases are random sequence corresponding to primer binding 
sites, and the inner 30 bases correspond to sequences in the Zmaxl gene. The T at the end of 
the first oligonucleotide corresponds to the HBM gene. The first two oligonucleotides are 
ligated only when hybridized to human DNA carrying the HBM gene, which results in the 
formation of an amplifiable 1 14 bp DNA fragment. 

Products of amplification can be detected by agarose gel electrophoresis, quantitative 
hybridization, or equivalent techniques for nucleic acid detection known to one skilled in the 
art of molecular biology (Sambrook et al, Molecular Cloning: A Laboratory Manual, Cold 
Spring Harbor Laboratory, Cold Spring, NY (1989)). 

Other alterations in the Zmaxl gene or the HBM gene may be diagnosed by the same 
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type of amplification-detection procedures, by using oligonucleotides designed to identify 
those alterations. These procedures can be used in animals as well as humans to identify 
alterations in Zmaxl or HBM that affect bone development. 

Expression of Zmaxl or HBM in bone tissue may be accomplished by fusing the 
5 cDNA of Zmaxlor HBM, respectively, to a bone-specific promoter in the context of a vector 
for genetically engineering vertebrate cells. DNA constructs are introduced into cells by 
packaging the DNA into virus capsids, by the use of cationic liposomes, electroporation, or 
by calcium phosphate transfection. Transfected cells, preferably osteoblasts, may be studied 
in culture or may be introduced into bone tissue in animals by direct injection into bone or by 
10 intravenous injection of osteoblasts, followed by incorporation into bone tissue (Ko et al, 
Cancer Research, 56(20):4614-9 (1996)). For example, the osteocalcin promoter, which is 
specifically active in osteoblasts, may be used to direct transcription of the Zmaxl gene or the 
HBM gene. Any of several vectors and transfection methods may be used, such as retroviral 
vectors, adenovirus vectors, or vectors that are maintained after transfection using cationic 
15 liposomes, or other methods and vectors described herein. 

Alteration of the level of functional Zmaxl protein or HBM protein affects the level 
of bone mineralization. By manipulating levels of functional Zmaxl protein or HBM protein, 
it is possible to affect bone development and to increase or decrease levels of bone 
mineralization. For example, it may be useful to increase bone mineralization in patients with 
20 osteoporosis. Alternatively, it may be useful to decrease bone mineralization in patients with 
osteopetrosis or Paget' s disease. Alteration of Zmaxl levels or HBM levels can also be used 
as a research tool. Specifically, it is possible to identify proteins, mRNA and other molecules 
whose level or modification status is altered in response to changes in functional levels of 
Zmaxl or HBM. The pathology and pathogenesis of bone disorders is known and described, 
25 for example, in Rubin and Farber (Eds.), Pathology, 2nd Ed., S.B. Lippincott Co., 
Philadelphia, PA (1994). 
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A variety of techniques can be used to alter the levels of functional Zmaxl or HBM. 
For example, intravenous or intraosseous injection of the extracellular portion of Zmaxl or 
mutations thereof, or HBM or mutations thereof, will alter the level of Zmaxl activity or 
HBM activity, respectively, in the body of the treated human, animal or bird. Truncated 
versions of the Zmaxl protein or HBM protein can also be injected to alter the levels of 
functional Zmaxl protein or HBM protein, respectively. Certain forms of Zmaxl or HBM 
enhance the activity of endogenous protein, while other forms are inhibitory. 

In a preferred embodiment, the HBM protein is used to treat osteoporosis. In a further 
preferred embodiment, the extracellular portion of the HBM protein is used. This HBM 
protein may be optionally modified by the addition of a moiety that causes the protein to 
adhere to the surface of cells. The protein is prepared in a pharmaceutically acceptable 
solution and is administered by injection or another method that achieves acceptable 
pharmacokinetics and distribution. 

In a second embodiment of this method, Zmaxl or HBM levels are increased or 
decreased by gene therapy techniques. To increase Zmaxl or HBM levels, osteoblasts or 
another useful cell type are genetically engineered to express high levels of Zmaxl or HBM 
as described above. Alternatively, to decrease Zmaxl or HBM levels, antisense constructs 
that specifically reduce the level of translatable Zmaxl or HBM mRNA can be used. In 
general, a tissue-nonspecific promoter may be used, such as the CMV promoter or another 
commercially available promoter found in expression vectors (Wu et al, Toxicol. Appl. 
Pharmacol., 141(l):330-9 (1996)). In a preferred embodiment, a Zmaxl cDNA or its 
antisense is transcribed by a bone-specific promoter, such as the osteocalcin or another 
promoter, to achieve specific expression in bone tissue. In this way, if a Zmaxl -expressing 
DNA construct or HBM-expressing construct is introduced into non-bone tissue, it will not be 
expressed. 

In a third embodiment of this method, antibodies against Zmaxl or HBM are used to 
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inhibit its function. Such antibodies are identified herein. 

In a fourth embodiment of this method, drugs that inhibit Zmaxl function or HBM 
function are used. Such drugs are described herein and optimized according to techniques of 
medicinal chemistry well known to one skilled in the art of pharmaceutical development. 

Zmaxl and HBM interact with several proteins, such as ApoE. Molecules that inhibit 
the interaction between Zmaxl or HBM and ApoE or another binding partner are expected to 
alter bone development and mineralization. Such inhibitors may be useful as drags in the 
treatment of osteoporosis, osteopetrosis, or other diseases of bone mineralization. Such 
inhibitors may be low molecular weight compounds, proteins or other types of molecules. 
See, Kim et al, J. Biochem. (Tokyo), 124(6): 1072-1076 (1998). 

Inhibitors of the interaction between Zmaxl or HBM and interacting proteins may be 
isolated by standard drug-screening techniques. For example, Zmaxl protein, (or a fragment 
thereof) or HBM protein (or a fragment thereof) can be immobilized on a solid support such 
as the base of microliter well. A second protein or protein fragment, such as ApoE is 
derivatized to aid in detection, for example with fluorescein. Iodine, or biotin, then added to 
the Zmaxl or HBM in the presence of candidate compounds that may specifically inhibit this 
protein-protein domain of Zmaxl or HBM, respectively, and thus avoid problems associated 
with its transmembrane segment. Drug screens of this type are well known to one skilled in 
the art of pharmaceutical development. 

Because Zmaxl and HBM are involved in bone development, proteins that bind to 
Zmaxl and HBM are also expected to be involved in bone development. Such binding 
proteins can be identified by standard methods, such as co-immunoprecipitation, co- 
fractionation, or the two-hybrid screen (Ausubel et al, Current Protocols in Molecular 
Biology, John Wiley & Sons (1997)). For example, to identify Zmaxl -interacting proteins or 
HBM-interacting proteins using the two-hybrid system, the extracellular domain of Zmaxl or 
HBM is fused to LexA and expressed for the yeast vector pEG202 (the "bait") and expressed 
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in the yeast strain EGY48. The yeast strain is transformed with a "prey" library in the 
appropriate vector, which encodes a galactose-inducible transcription-activation sequence 
fused to candidate interacting proteins. The techniques for initially selecting and 
subsequently verifying interacting proteins by this method are well known to one skilled in 
the art of molecular biology (Ausubel et al, Current Protocols in Molecular Biology, John 

Wiley & Sons (1997)). 

In a preferred embodiment, proteins that interact with HBM, but not Zmaxl, are 
identified using a variation of the above procedure (Xu et al, Proc. Natl. Acad. Set USA, 
94(23): 12473-8 (Nov. 1997)). This variation of the two-hybrid system uses two baits, and 
Zmaxl and HBM are each fused to LexA and TetR, respectively. Alternatively, proteins that 
interact with the HBM but not Zmaxl are also isolated. These procedures are well known to 
one skilled in the art of molecular biology, and are a simple variation of standard two-hybrid 
procedures. 

As an alternative method of isolating Zmaxl or HBM interacting proteins, a 
biochemical approach is used. The Zmaxl protein or a fragment thereof, such as the 
extracellular domain, or the HBM protein or a fragment thereof, such as the extracellular 
domain, is chemically coupled to Sepharose beads. The Zmaxl - or HBM-coupled beads are 
poured into a column. An extract of proteins, such as serum proteins, proteins in the 
supernatant of a bone biopsy, or intracellular proteins from gently lysed TE85 osteoblastic 
cells, is added to the column. Non-specifically bound proteins are eluted, the column is 
washed several times with a low-salt buffer, and then tightly binding proteins are eluted with 
a high-salt buffer. These are candidate proteins that bind to Zmaxl or HBM, and can be 
tested for specific binding by standard tests and control experiments. Sepharose beads used 
for coupling proteins and the methods for performing the coupling are commercially available 
(Sigma), and the procedures described here are well known to one skilled in the art of protein 
biochemistry. 
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As a variation of the above procedure, proteins that are eluted by high salt from the 
Zmaxl- or HBM-Sepharose column are then added to an HBM-Zmaxl-sepharose column. 
Proteins that flow through without sticking are proteins that bind to Zmaxl but not to HBM. 
Alternatively, proteins that bind to the HBM protein and not to the Zmaxl protein can be 
isolated by reversing the order in which the columns are used. 

XXI. Method of Use: Transformation-Associated Recombination (TAR) Cloning 

Essential for the identification of novel allelic variants of Zmaxl is the ability to 
examine the sequence of both copies of the gene in an individual. To accomplish this, two 
"hooks," or regions of significant similarity, are identified within the genomic sequence such 
that they flank the portion of DNA that is to be cloned. Most preferably, the first of these 
hooks is derived from sequences 5' to the first exon of interest and the second is derived from 
sequences 3' to the last exon of interest. These two "hooks" are cloned into a bacterial/yeast 
shuttle vector such as that described by Larionov et al, Proc. Natl. Acad. Sci. USA, 94:7384- 
7387 (1997). Other similar vector systems may also be used. To recover the entire genomic 
copy of the Zmaxl gene, the plasmid containing the two "hooks" is linearized with a 
restriction endonuclease or is produced by another method such as PCR. This linear DNA 
fragment is introduced into yeast cells along with human genomic DNA. Typically, the yeast 
Saccharomyces cerevisiae is used as a host cell, although Larionov et al (in press) have 
reported using chicken host cells as well. During and after the process of transformation, the 
endogenous host cell converts the linear plasmid to a circle by a recombination event 
whereby the region of the human genomic DNA homologous to the "hooks" is inserted into 
the plasmid. This plasmid can be recovered and analyzed by methods well known to one 
skilled in the art. Obviously, the specificity for this reaction requires the host cell machinery 
to recognize sequences similar to the "hooks" present in the linear fragment. However, 100% 
sequence identity is not required, as shown by Kouprina et al, Genomics, 53(l):21-28 
(October 1998), where the author describes using degenerate repeated sequences common in 
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the human genome to recover fragments of human DNA from a rodent/human hybrid cell 
line. 

In another example, only one "hook" is required, as described by Larionov et al, Proc. 
Natl. Acad. Sci. USA, 95(8):4469-74 (April 1998). For this type of experiment, termed 
"radial TAR cloning," the other region of sequence similarity to drive the recombination is 
derived from a repeated sequence from the genome. In this way, regions of DNA adjacent to 
the Zmaxl gene coding region can be recovered and examined for alterations that may affect 
function. 

XXII. Methods of Use: Genomic Screening 

The use of polymorphic genetic markers linked to the HBM gene or to Zmaxl is very 
useful in predicting susceptibility to osteoporosis or other bone diseases. Roller et al, Amer. 
J. Bone Min. Res., 13:1903-1908 (1998) have demonstrated that the use of polymorphic 
genetic markers is useful for linkage analysis. Similarly, the identification of polymorphic 
genetic markers within the high bone mass gene will allow the identification of specific 
allelic variants that are in linkage disequilibrium with other genetic lesions that affect bone 
development. Using the DNA sequence from the BACs, a dinucleotide CAn repeat was 
identified and two unique PCR primers that will amplify the genomic DNA containing this 
repeat were designed, as shown below: 

B200E21C16_L: GAGAGGCTATATCCCTGGGC (SEQ ID NO:38) 
B200E21C16_R: ACAGCACGTGTTTAAAGGGG (SEQ ID NO:39) 

and used in the genetic mapping study. 

This method has been used successfully by others skilled in the art (e.g., Sheffield et 
al, Genet., 4:1837-1844 (1995); LeBlanc-Straceski et al, Genomics, 19:341-9 (1994); Chen et 
al, Genomics, 25:1-8 (1995)). Use of these reagents with populations or individuals will 
predict their risk for osteoporosis. Similarly, single nucleotide polymorphisms (SNPs), such 
as those shown in Table 4 above, can be used as well to predict risk for developing bone 
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diseases or resistance to osteoporosis in the case of the HBM gene. 
XXIII. Methods of Use: Modulators of Tissue Calcification 

The calcification of tissues in the human body is well documented. Towler et al, J. 
Biol. Chem., 273:30427-34 (1998) demonstrated that several proteins known to regulate 
calcification of the developing skull in a model system are expressed in calcified aorta. The 
expression of Msx2, a gene transcribed in osteoprogenitor cells, in calcified vascular tissue 
indicates that genes which are important in bone development are involved in calcification of 
other tissues. Treatment with HBM protein, agonists or antagonists is likely to ameliorate 
calcification (such as the vasculature, dentin and bone of the skull visera) due to its 
demonstrated effect on bone mineral density. In experimental systems where tissue 
calcification is demonstrated, the over-expression or repression of Zmaxl activity permits the 
identification of molecules that are directly regulated by the Zmaxl gene. These genes are 
potential targets for therapeutics aimed at modulating tissue calcification. For example, an 
animal, such as the LDLR -/-, mouse is fed a high fat diet and is observed to demonstrate 
expression of markers of tissue calcification, including Zmaxl. These animals are then 
treated with antibodies to Zmaxl or HBM protein, antisense oligonucleotides directed against 
Zmaxl or HBM cDNA, or with compounds known to bind the Zmaxl or HBM protein or its 
binding partner or ligand. RNA or proteins are extracted from the vascular tissue and the 
relative expression levels of the genes expressed in the tissue are determined by methods well 
known in the art. Genes that are regulated in the tissue are potential therapeutic targets for 
pharmaceutical development as modulators of tissue calcification. 

The nucleic acids, proteins, peptides, amino acids, small molecules or other 
pharmaceutically useful compounds of the present invention that are to be given to an 
individual may be administered in the form of a composition with a pharmaceutically 
acceptable carrier, excipient or diluent, which are well known in the art. The individual may 
be a mammal or a bird, preferably a human, a rat, a mouse or bird. Such compositions may 
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be administered to an individual in a pharmaceutically effective amount. The amount 
administered will vary depending on the condition being treated and the patient being treated. 
The compositions may be administered alone or in combination with other treatments. 

EXAMPLES 

The present invention is described by reference to the following Examples, which are 
offered by way of illustration and are not intended to limit the invention in any manner. 
Standard techniques well known in the art or the techniques specifically described below 
were utilized. 
Example 1 

The propositus was referred by her physicians to the Creighton Osteoporosis Center 
for evaluation of what appeared to be unusually dense bones. She was 1 8 years old and came 
to medical attention two years previous because of back pain, which was precipitated by an 
auto accident in which the car in which she was riding as a passenger was struck from behind. 
Her only injury was soft tissue injury to her lower back that was manifested by pain and 
muscle tenderness. There was no evidence of fracture or subluxation on radiographs. The 
pain lasted for two years, although she was able to attend school full time. By the time she 
was seen in the Center, the pain was nearly resolved and she was back to her usual activities 
as a high school student. Physical exam revealed a normal healthy young woman standing 66 
inches and weighing 128 pounds. Radiographs of the entire skeleton revealed dense looking 
bones with thick cortices. All bones of the skeleton were involved. Most importantly, the 
shapes of all the bones were entirely normal. The spinal BMC was 94.48 grams in LI -4, and 
the spinal BMD was 1.667 gm/cm 2 in LI -4. BMD was 5.62 standard deviations (SD) above 
peak skeletal mass for women. These were measured by DXA using a Hologic 2000-. Her 
mother was then scanned and a lumbar spinal BMC of 58.05 grams and BMD of 1.500 
gm/cm 2 were found. Her mother's values place her 4.12 SD above peak mass and 4.98 SD 
above her peers. Her mother was 51 years old, stood 65 inches and weighed 140 pounds. 
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Her mother was in excellent health with no history of musculoskeletal or other symptoms. 
Her father's lumbar BMC was 75.33 grams and his BMD was 1.1 18 gm/cm 2 . These values 
place him 0.25 SD above peak bone mass for males. He was in good health, stood 72 inches 

tall, and weighed 187 pounds. 

These clinical data suggested that the propositus inherited a trait from her mother, 
which resulted in very high bone mass, but an otherwise normal skeleton, and attention was 
focused on the maternal kindred. In U.S. Patent No. 5,691,153, twenty- two of these 
members had measurement of bone mass by DXA. In one case, the maternal grandfather of 
the propositus, was deceased, however, medical records, antemortem skeletal radiographs and 
a gall bladder specimen embedded in paraffin for DNA genotyping were obtained. His 
radiographs showed obvious extreme density of all of the bones available for examination 
including the femur and the spine, and he was included among the affected members. In this 
invention, the pedigree has been expanded to include 37 informative individuals. These 
additions are a significant improvement over the original kinship (Johnson et al, Am. J. Hum. 
Genet, 60:1326-1332 (1997)) because, among the fourteen individuals added since the 
original study, two individuals harbor key crossovers. X-linkage is ruled out by the presence 
of male-to-male transmission from individual 12 to 14 and 15. 
Example 2 

The present invention describes DNA sequences derived from two BAC clones from 
the HBM gene region, as evident in Table 6 below, which is an assembly of these clones. 
Clone b200e21-h (ATCC No. 980812; SEQ ID NOS: 10-1 1) was deposited at the American 
Type Culture Collection (ATCC), 10801 University Blvd., Manassas, VA 201 10-2209 
U.S.A., on December 30, 1997. Clone b527dl2-h (ATCC No. 980720; SEQ ID NOS: 5-9) 
was deposited at the American Type Culture Collection (ATCC), 10801 University Blvd., 
Manassas, VA 201 10-2209 U.S.A., on October 2, 1998. These sequences are unique reagents 
that can be used by one skilled in the art to identify DNA probes for the Zmaxl gene, PCR 
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primers to amplify the gene, nucleotide polymorphisms in the Zmaxl gene, or regulatory 
elements of the Zmaxl gene. 

TABLE 6 



Contig 


ATCC No. 


SEQ ID 
NO. 


Length (base 
pairs) 


b527dl2-h_contig302G 


980720 


5 


3096 


b527dl 2-h_contig306G 


980720 


6 


26928 


b527dl2-h_contig307G 


980720 


7 


29430 


b527dl2-h_contig308G 


980720 


8 


33769 


b527dl2-h_contig309G 


980720 


9 


72049 


b200e21-h_contigl 


980812 


10 


8705 


b200e21-h_contig4 


980812 


11 


66933 



The disclosure of each of the patents, patent applications and publications cited in the 
specification is hereby incorporated by reference herein in its entirety. 

Although the invention has been set forth in detail, one skilled in the art will recognize 
that numerous changes and modifications can be made, and that such changes and 
modifications may be made without departing from the spirit and scope of the invention. 
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CLAIMS 

What is claimed is: 

1 . An isolated amino acid sequence of SEQ ID NO: 4. 

2. A method of identifying a molecule involved in bone modulation comprising 
identifying a molecule that binds to, or that inhibits binding of a molecule to, HBM. 

3 . The method of claim 2, wherein said molecule is a protein. 

4. A method for identifying a protein involved in bone modulation comprising 
identifying a protein that has an expression level that is different in a first host comprising the 
Zmaxl gene when compared to a second host comprising the HBM gene. 

5 . The method of claim 4, wherein the host is a cell or an animal. 



6. A method of identifying a candidate protein involved in bone modulation 
comprising 

identifying a protein in a first individual having the high bone mass phenotype; 

identifying a protein in a second individual not having the high bone mass phenotype; 

comparing the protein of the first individual to the protein of the second individual, 
wherein (i) the protein that is present in the first individual but not the second individual is 
the candidate protein or (ii) the protein that is present in a higher amount in the first 
individual than in the second individual is the candidate protein or (iii) the protein that is 
present in a lower amount in the first individual than in the second individual is the candidate 
protein. 
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7. The method of claim 6, further comprising producing an antibody to the 
candidate protein. 

8 . A method of identifying a candidate protein involved in bone modulation 
comprising 

identifying a protein in a first individual having the high bone mass phenotype; 

identifying a protein in a second individual not having the high bone mass phenotype; 

comparing the protein of the first individual to the protein of the second individual, 
wherein (i) the protein that is present in the second individual but not the first individual is 
the candidate protein or (ii) the protein that is present in a higher amount in the second 
individual than in the first individual is the candidate protein or (iii) the protein that is present 
in a lower amount in the second individual than in the first individual is the candidate protein. 

9 . The method of claim 8 , further comprising producing an antibody to the 
candidate protein. 

10. A method of testing for HBM activity comprising immobilizing an HBM 
protein, binding a protein to the HBM protein, and measuring the extent of binding. 

1 1 . The method of claim 10, wherein the protein is ApoE. 

12. A method of pharmaceutical development for treatment of bone development 
disorders comprising identifying a molecule that binds to the amino acid sequence of SEQ IE 
NO: 4. 
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13. The method of claim 1 2, wherein the molecule inhibits or enhances the 
function of the amino acid. 

14. A method of altering bone development in a host comprising administering the 
amino acid sequence of claim 1 to a somatic cell of a host suffering from a bone development 
disorder. 

1 5 . The method of claim 1 4, wherein the host is a human or another vertebrate. 

16. A method of altering bone development in a host comprising administering the 
amino acid sequence of claim 1 to a germ-line cell in a host suffering from a bone 
development disorder. 

17. The method of claim 16, wherein the animal is a human or another vertebrate. 

18. A method of treating osteoporosis comprising administering the amino acid 
sequence of claim 1 to a patient in need thereof 

19. The method of claim 1 8, wherein the patient is a human or another vertebrate. 

20. A method of treating osteoporosis comprising administering the extracellular 
domain of the amino acid sequence of claim 1 to a patient in need thereof. 

2 1 . The method of claim 20, wherein the patient is a human or another vertebrate. 
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22. A method of treating osteoporosis comprising administering the intracellular 
domain of the amino acid sequence of claim 1 to a patient in need thereof. 

23 . The method of claim 22, wherein the patient is a human or another vertebrate, 

24. A method for treating bone development disorders comprising administering 
an antibody to a patient in need thereof, wherein the antibody is to the amino acid sequence of 
claim 1 . 

25. A diagnostic assay for bone development disorders comprising an antibody to 
the HBM protein. 



-113- 



Atty. Dkt. No.: 032796-014 

ABSTRACT 

The present invention relates to methods and materials used to isolate and detect a 
high bone mass gene and a corresponding wild-type gene, and mutants thereof. The present 
invention also relates to the high bone mass gene, the corresponding wild-type gene, and 
mutants thereof. The genes identified in the present invention are implicated in bone 
development. The invention also provides nucleic acids, including coding sequences, 
oligonucleotide primers and probes, proteins, cloning vectors, expression vectors, 
transformed hosts, methods of developing pharmaceutical compositions, methods of 
identifying molecules involved in bone development, and methods of diagnosing and treating 
diseases involved in bone development. In preferred embodiments, the present invention is 
directed to methods for treating, diagnosing and preventing osteoporosis. 
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Exon 1 

ACTAAAGCGCCGCCGCCGCGCCATGGAGCCCGAGTGAGCGCGGCGCGG 
GCCCGTCCGGCCGCCGGACAACATGGAGGCAGCGCCGCCCGGGCCGCC 
GTGGCCGCTGCTGCTGCTGCTGCTGCTGCTGCTGGCGCTGTGCGGCTGC 

CCGGCCCCCGCCGCGGCC 

Exon 2 Coordinates: 527dl2_Contig308G 30944-30549 

gccccacagCCTCGCCGCTCCTGCTATTTGCCAACCGCCGGGACGTACGGCTG 
GTGGACGCCGGCGGAGTCAAGCTGGAGTCCACCATCGTGGTCAGCGGCC 
TGGAGGATGCGGCCGCAGTGGACTTCCAGTTTTCCAAGGGAGCCGTGTA 
CTGGACAGACGTGAGCGAGGAGGCCATCAAGCAGACCTACCTGAACCAG 
ACGGGGGCCGCCGTGCAGAACGTGGTCATCTCCGGCCTGGTCTCTCCCG 
ACGGCCTCGCCTGCGACTGGGTGGGCAAGAAGCTGTACTGGACGGACTC 
AGAGACCAACCGCATCGAGGTGGCCAACCTCAATGGCACATCCCGGAAG 
GTGCTCTTCTGGCAGGACCTTGACCAGCCGAGGGCCATCGCCTTGGACC 

CCGCTCACGGgtaaaccctgctg 
... 9408 nt ... 

Exon 3 Coordinates: 527dl2_Contig308G 21141-20945 

ccccgtcacagGTACATGTACTGGACAGACTGGGGTGAGACGCCCCGGATTGA 
GCGGGCAGGGATGGATGGCAGCACCCGGAAGATCATTGTGGACTCGGAC 
ATTTACTGGCCCAATGGACTGACCATCGACCTGGAGGAGCAGAAGCTCT 
ACTGGGCTGACGCCAAGCTCAGCTTCATCCACCGTGCCAACCTGGACGG 

CTCGTTCCGgtaggtacccac 
... 6094 nt... 

Exon 4 Coordinates: 527dl2_Contig308G 15047-14850 

tccctgactgcagGCAGAAGGTGGTGGAGGGCAGCCTGACGCACCCCTTCGCCC 
TGACGCTCTCCGGGGACACTCTGTACTGGACAGACTGGCAGACCCGCTC 
CATCCATGCCTGCAACAAGCGCACTGGGGGGAAGAGGAAGGAGATCCTG 
AGTGCCCTATACTCACCCATGGACATCCAGGTGCTGAGCCAGGAGCGGC 

AGCCTTTCTgtgagtgccgg 
... 1827 nt... 

Exon 5 Coordinates: 527dl2_Contig308G 13220-13088 

tttctcagTCCACACTCGCTGTGAGGAGGACAATGGCGGCTGCTCCCACCTGT 
GCCTGCTGTCCCCAAGCGAGCCTTTCTACACATGCGCCTGCCCCACGGG 
TGTGCAGCTGCAGGACAACGGCAGGACGTGTAAGGCAGgtgaggcggtgggacg 
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... 20923 nt ... 

Exon 6 Coordinates: 527dl2_Contig309G 7705-8100 

ctccacagGAGCCGAGGAGGTGCTGCTGCTGGCCCGGCGGACGGACCTACGG 

AGGATCTCGCTGGACACGCCGGACTTCACCGACATCGTGCTGCAGGTGG 

ACGACATCCGGCACGCCATTGCCATCGACTACGACCCGCTAGAGGGCTA 

TGTCTACTGGACAGATGACGAGGTGCGGGCCATCCGCAGGGCGTACCTG 

GACGGGTCTGGGGCGCAGACGCTGGTCAACACCGAGATCAACGACCCCG 

ATGGCATCGCGGTCGACTGGGTGGCCCGAAACCTCTACTGGACCGACAC 

GGGCACGGACCGCATCGAGGTGACGCGCCTCAACGGCACCTCCCGCAAG 

ATCCTGGTGTCGGAGGACCTGGACGAGCCCCGAGCCATCGCACTGCACC 

CCGTGATGGGgtaagacgggc 
3211 nt 

Exon 7 Coordinates: 527dl2_Contig309G 11311-11482 

ttcttctccagCCTCATGTACTGGACAGACTGGGGAGAGAACCCTAAAATCGAGT 
GTGCCAACTTGGATGGGCAGGAGCGGCGTGTGCTGGTCAATGCCTCCCT 
CGGGTGGCCCAACGGCCTGGCCCTGGACCTGCAGGAGGGGAAGCTCTAC 
TGGGGAGACGCCAAGACAGACAAGATCGAGgtgaggctcctgtgg 



13445 nt..... 

Exon 8 Coordinates: 527dl2_Contig309G 24927-25143 

ccgtcctgcagGTGATCAATGTTGATGGGACGAAGAGGCGGACCCTCCTGGAG 

GACAAGCTCCCGCACATTTTCGGGTTCACGCTGCTGGGGGACTTCATCTA 

CTGGACTGACTGGCAGCGCCGCAGCATCGAGCGGGTGCACAAGGTCAAG 

GCCAGCCGGGACGTCATCATTGACCAGCTGCCCGACCTGATGGGGCTCA 

AAGCTGTGAATGTGGCCAAGGTCGTCGgtgagtccggggggtc 



....2826 nt 

Exon 9 Coordinates: 527dl2_Contig309G 27969-28256 

gttcgcttccagGAACCAACCCGTGTGCGGACAGGAACGGGGGGTGCAGCCACC 

TGTGCTTCTTCACACCCCACGCAACCCGGTGTGGCTGCCCCATCGGCCT 

GGAGCTGCTGAGTGACATGAAGACCTGCATCGTGCCTGAGGCCTTCTTG 

GTCTTCACCAGCAGAGCCGCCATCCACAGGATCTCCCTCGAGACCAATA 

ACAACGACGTGGCCATCCCGCTCACGGGCGTCAAGGAGGCCTCAGCCCT 

GGACTTTGATGTGTCCAACAACCACATCTACTGGACAGACGTCAGCCTGA 

AGgtagcgtgggc 
3102 

FIGURE 3B 



Exon 10 Coordinates: 527dl2_Contig309G 31358-31582 

cctgctgccagACCATCAGCCGCGCCTTCATGAACGGGAGCTCGGTGGAGCAC 

GTGGTGGAGTTTGGCCTTGACTACCCCGAGGGCATGGCCGTTGACTGGA 

TGGGCAAGAACCTCTACTGGGCCGACACTGGGACCAACAGAATCGAAGT 

GGCGCGGCTGGACGGGCAGTTCCGGCAAGTCCTCGTGTGGAGGGACTTG 

GACAACCCGAGGTCGCTGGCCCTGGATCCCACCAAGGGgtaagtgtttgcctgtc 



1297 nt 

Exon 11 Coordinates: 527dl2_Contig309G 32879-33064 

gtgccttccagCTACATCTACTGGACCGAGTGGGGCGGCAAGCCGAGGATCGTG 
CGGGCCTTCATGGACGGGACCAACTGCATGACGCTGGTGGACAAGGTGG 
GCCGGGCCAACGACCTCACCATTGACTACGCTGACCAGCGCCTCTACTG 
GACCGACCTGGACACCAACATGATCGAGTCGTCCAACATGCTGGgtgagggcc 

gggct 

2069 nt 

Exon 12 Coordinates: 527dl2_Contig309G 35133-35454 

gtgttcatgcagGTCAGGAGCGGGTCGTGATTGCCGACGATCTCCCGCACCCGT 

TCGGTCTGACGCAGTACAGCGATTATATCTACTGGACAGACTGGAATCTG 

CACAGCATTGAGCGGGCCGACAAGACTAGCGGCCGGAACCGCACCCTCA 

TCCAGGGCCACCTGGACTTCGTGATGGACATCCTGGTGTTCCACTCCTCC 

CGCCAGGATGGCCTCAATGACTGTATGCACAACAACGGGCAGTGTGGGC 

AGCTGTGCCTTGCCATCCCCGGCGGCCACCGCTGCGGCTGCGCCTCACA 

CTACACCCTGGACCCCAGCAGCCGCAACTGCAGCCgtaagtgcctcatggt 



2006 nt 

Exon 13 Coordinates: 527dl2_Contig309G 37460-37659 

gcctcctctaCGCCCACCACCTTCTTGCTGTTCAGCCAGAAATCTGCCATCAGT 
CGGATGATCCCGGACGACCAGCACAGCCCGGATCTCATCCTGCCCCTGC 
ATGGACTGAGGAACGTCAAAGCCATCGACTATGACCCACTGGACAAGTT 
CATCTACTGGGTGGATGGGCGCCAGAACATCAAGCGAGCCAAGGACGAC 

GGGACCCAGgcaggtgccctgtgg 
6965 nt 
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Exon 14 Coordinates: 527dl2_Contig309G 44624-44832 

ctttgtcttacagCCCTTTGTTTTGACCTCTCTGAGCCAAGGCCAAAACCCAGACA 

GGCAGCCCCACGACCTCAGCATCGACATCTACAGCCGGACACTGTTCTG 

GACGTGCGAGGCCACCAATACCATCAACGTCCACAGGCTGAGCGGGGAA 

GCCATGGGGGTGGTGCTGCGTGGGGACCGCGACAAGCCCAGGGCCATC 

GTCGTCAACGCGGAGCGAGGgtaggaggccaac 

1404 nt 

Exon 15 Coordinates: 527dl2_Contig309G 46236-46427 

ccaccctcccgcagGTACCTGTACTTCACCAACATGCAGGACCGGGCAGCCAAGA 
TCGAACGCGCAGCCCTGGACGGCACCGAGCGCGAGGTCCTCTTCACCAC 
CGGCCTCATCCGCCCTGTGGCCCTGGTGGTGGACAACACACTGGGCAAG 
CTGTTCTGGGTGGACGCGGACCTGAAGCGCATTGAGAGCTGTGACCTGT 

CAGgtacgcgccccgg 
686 nt 

Exon 16 Coordinates: 527dl2_Contig309G 47113-47322 

ggctgcttgcagGGGCCAACCGCCTGACCCTGGAGGACGCCAACATCGTGCAGC 

CTCTGGGCCTGACCATCCTTGGCAAGCATCTCTACTGGATCGACCGCCA 

GCAGCAGATGATCGAGCGTGTGGAGAAGACCACCGGGGACAAGCGGACT 

CGCATCCAGGGCCGTGTCGCCCACCTCACTGGCATCCATGCAGTGGAGG 

AAGTCAGCCTGGAGGAGTTCTgtacgtgggggc 

3884nt 

Exon 17 Coordinates: 527dl2_Contig309G 51206-51331 

ttgtctttgcagCAGCCCACCCATGTGCCCGTGACAATGGTGGCTGCTCCCACAT 
CTGTATTGCCAAGGGTGATGGGACACCACGGTGCTCATGCCCAGTCCAC 
CTCGTGCTCCTGCAGAACCTGCTGACCTGTGGAGgtaggtgtgacctaggtgc 

....3905 nt 

Exon 18 Coordinates: 527dl2_Contig309G 55236-55472 

gttctcctctgtccctcccccagAGCCGCCCACCTGCTCCCCGGACCAGTTTGCATGTGC 

CACAGGGGAGATCGACTGTATCCCCGGGGCCTGGCGCTGTGACGGCTTT 

CCCGAGTGCGATGACCAGAGCGACGAGGAGGGCTGCCCCGTGTGCTCCG 

CCGCCCAGTTCCCCTGCGCGCGGGGTCAGTGTGTGGACCTGCGCCTGCG 

CTGCGACGGCGAGGCAGACTGTCAGGACCGCTCAGACGAGGTGGACTGT 

GACGgtgaggccctcc 
3052 nt 

FIGURE 3D 



Exon 19 Coordinates: 527dl2_Contig309G 58524-58634 

tctccttgcagCCATCTGCCTGCCCAACCAGTTCCGGTGTGCGAGCGGCCAGTG 

TGTCCTCATCAAACAGCAGTGCGACTCCTTCCCCGACTGTATCGACGGCT 

CCGACGAGCTCATGTGTGgtgagccagctt 

1448 nt 



Exon 20 Coordinates: 527dl2_Contig309G 60082-60319 

gtttgtctctggcagAAATCACCAAGCCGCCCTCAGACGACAGCCCGGCCCACAGC 

AGTGCCATCGGGCCCGTCATTGGCATCATCCTCTCTCTCTTCGTCATGGG 

TGGTGTCTATTTTGTGTGCCAGCGCGTGGTGTGCCAGCGCTATGCGGGG 

GCCAACGGGCCCTTCCCGCACGAGTATGTCAGCGGGACCCCGCACGTGC 

CCCTCAATTTCATAGCCCCGGGCGGTTCCCAGCATGGCCCCTTCACAGgta 

aggagcctgagatatggaa 



....1095 nt 

Exon 21 Coordinates: 527dl2_Contig309G 61414-61552 

cttccctgccagGCATCGCATGCGGAAAGTCCATGATGAGCTCCGTGAGCCTGA 
TGGGGGGCCGGGGCGGGGTGCCCCTCTACGACCGGAACCACGTCACAG 
GGGCCTCGTCCAGCAGCTCGTCCAGCACGAAGGCCACGCTGTACCCGCC 

Ggtgaggggcggg 



6513 nt 

Exon 22 Coordinates: 527dl2_Conti g 309G 68065-68162 

ttggctctcctcagATCCTGAACCCGCCGCCCTCCCCGGCCACGGACCCCTCCCTG 
TACAACATGGACATGTTCTACTCTTCAAACATTCCGGCCACTGCGAGACC 

GTACAGgtaggacatcccctgcag 



.2273 nt 
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Exon 23 Coordinates: 527dl2_Contig309G 70435-70901 

tcaaacattccggccactgcgagaccgtacagGCCCTACATCATTCGAGGAATGGCGCCCCC 

GACGACGCCCTGCAGCACCGACGTGTGTGACAGCGACTACAGCGCCAGC 

CGCTGGAAGGCCAGCAAGTACTACCTGGATTTGAACTCGGACTCAGACC 

CCTATCCACCCCCACCCACGCCCCACAGCCAGTACCTGTCGGCGGAGGA 

CAGCTGCCCGCCCTCGCCCGCCACCGAGAGGAGCTACTTCCATCTCTTC 

rrarrrrrTrraTrrrmac a raoArTCATCC TGACCTCGGCCGGGCCA 

CTCTGGCTTCTCTGTGCCCCTGTAAATAGTTTTAAATATGAACAAAGAAAAAA 

ATATATTTTATGATTTAAAAAATAAATATAATTGGGATTTTAAAAACATGAGA 

AATGTGAACTGTGATGGGGTGGGCAGGGCTGGGAGAACTTTGTACAGTGGAG 

AAATATTTATAAACTTAATTTTGTAAAACA 
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1 . ACTAAAGCGCGGCCGCCGCGCCATGGAGCCCGAGTGAGCGCGGCGCGGGCCCGTCCGGCC 6 0 
6 1 GCCGGACAACATGGAGGCAGCGCCGCCCGGGCCGCCGTGGCCGCTGCTGCTGCTGCTGCT 120 
1 MEAAPPGPPWPL LLLLL 17 • 



121 GCTGCTGCTGGCGCTGTGCGGCTGCCCGGCCCCCGCCGCGGCCTCGCCGCTCCTGCTATT 
18 LLIiALGGCPAPAAASPLLLF 



180 
37 



181 TGCCAACCGCCGGGACGTACGGCTGGTGGACGCCGGCGGAGTCAAGCTGGAGTCCACCAT 240 

38 ANRRDVRLVDAGGVKLESTI 57 

241 CGTGGTCAGCGGCCTGGAGGATGCGGCCGCAGTGGACTTCCAGTTTTCCAAGGGAGCCGT 3 0 0 

58 .VVS GL.EDAAAVD F Q F S KGAV 77 

3 01 GTACTGGACAGACGTGAGCGAGGAGGCCATCAAGCAGAC CTACCTGAAC CAGACGGGGGC 3 60 

78 YWTDV SEEAIKQTYLNQTGA 97 

3 61 CGCCGTGCAGAACGTGGTCATCTCCGGCCTGGTCTCTCCCGACGGCCTCGCCTGCGACTG 42 0 
98 AVQNVVISGLVSPDGLACDW 117 

421 GGTGGGCAAGAAGCTGTACTGGACGGACTCAGAGAGCAACCGCATCGAGGTGGCCAACCT 48 0 

118 VGKKIiYWTD S E T IT R I EVANL 137 

4 81 CAATGGCACATCCCGGAAGGTGCTCTTCTGGCAGGACCTTGACCAGCCGAGGGCCATCGC 540 
138 N.GTSRKVLFWQDLDQPRAIA 157 

541 CTTGGACC CCGCTC ACGGGTAC ATGTACTGGACAGACTGGGGTGAGACGC C C CGGATTGA 600 

158 LDPAHGY MYWTDWGETPRIE 177 

601 GCGGGCAGGGATGGATGGCAGCAC CCGGAAGATCATTGTGGACTCGGACATTTACTGGC C 660 

178 RAGMDGSTRKI IVDSD IYWP 197 

661 CAATGGACTGAC CATCGACCTGGAGGAGC AGAAGCTCTACTGGGCTGACGC CAAGCTCAG 720 

198 NGLTIDLEEQKLYWADAKLS 217 

721 CTTCATCCACCGTGCCAACCTGGACGGCTCGTTCCGGCAGAAGGTGGTGGAGGGCAGCCT 7 8 0 

218 F-IHRANLDGSFRQKVVEGSL 237 

7 81 GACGCACCCCTTCGCCCTGACGCTCTCCGGGGACACTCTGTACTGGACAGACTGGCAGAC 840 

238 THPFALTLSGDTLYWTDWQT 257 

841 C CGCTGC ATCCATGCCTGCAACAAGCGCACTGGGGGGAAGAGGAAGGAGATC CTGAGTGC 900 

258 RSIHACNKRTGGKRKEILSA 277 

901 CCTCTACTCACCCATGGACATCCAGGTGCTGAGCCAGGAGCGGCAGCCTTTCTTCCACAC 960 

278 LYSPMD IQVLS QERQPFFHT 297 

961 TCGCTGTGAGGAGGACAATGGCGGCTGCTCCCACCTGTGCCTGCTGTCCCCAAGCGAGCC 1020 

298 RCEEDNGGCSHLCLLSPSEP 317 

1021 TTTCTACACATGCGCCTGCCCCACGGGTGTGCAGCTGCAGGACAACGGCAGGACGTGTAA 10 80 

318 FYTCACPTG VQLQD UGRTCK 337 

1081 GGCAGGAGCCGAGGAGGTGCTGCTGCTGGCCCGGCGGACGGACCTACGGAGGATCTCGCT 1140 

338 AGAEEVLLLARRTDLRRIS L 357 
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1141 GGACACGCCGGACTTCACCGACATCGTGCTGCAGGTGGACGACATCCGGCACGCCATTGC 12 00 

358 DTPDFTDIVLQVDDIRHAIA 377 

12 01 CATCGACTACGACCCGCTAGAGGGCTATGTCTACTGGACAGATGACGAGGTGCGGGCCAT 12 60 

378 IDYDPLEGYVYWT DDEVRAI 397 

12 61 CCGCAGGGCGTACCTGGACGGGTCTGGGGCGCAGACGCTGGTCAACACCGAGATCAACGA 13 2 0 
398 RRAYLDGSGAQTLVNTSIN D 417 

13 21 CCCCGATGGCATCGCGGTCGACTGGGTGGCCCGAAACCTCTACTGGACCGACACGGGCAC 13 80 
418 PDGIAVDWVARNLYWTDTGT 437 

13 81 GGACCGCATCGAGGTGACGCGCCTCAACGGCACCTCCCGCAAGATCCTGGTGTCGGAGGA 144 0 

438 DRIEVTRLtfGTSRKILVSED 457 

1441 CCTGGACGAGCCCCGAGCCATGGCACTGCACCCCGTGATGGGCCTCATGTACTGGACAGA 1500 

458 LDEPRAIAL HPVMGLMYWTD 477 

15 01 CTGGGGAGAGAACCCTAAAATCGAGTGTGCCAACTTGGATGGGCAGGAGCGGCGTGTGCT 15 60 
478 WGEtfPKIECANLDGQERRVL 497 

1561 GGTCAATGCCTCCCTCGGGTGGCCCAACGGCCTGGCCCTGGACCTGCAGGAGGGGAAGCT 162 0 

498 VNASLGWPNGLALDLQEGKL 517 

1621 CTACTGGGGAGACGCCAAGACAGACAAGATCGAGGTGATCAATGTTGATGGGACGAAGAG 168 0 

518 YWGDAKTDKIEVINVDGTKR 537 

16 81 GCGGACCCTCCTGGAGGACAAGCTCCCGCACATTTTCGGGTTCACGCTGCTGGGGGACTT 174 0 
538 RTLLEDKLPHIFGFTLLGDF 557 

1741 CATCTACTGGACTGACTGGCAGCGCCGCAGCATCGAGCGGGTGCACAAGGTCAAGGCCAG 1800 

558 IYWTDWQRRSIERVHKVKAS 577 

18 01 CCGGGACGTCATCATTGACCAGCTGCCCGACCTGATGGGGCTCAAAGCTGTGAATGTGGG 1860 

578 RDVIIDQLPDLMG LKAVNVA 597 

18 61 CAAGGTCGTCGGAACCAACCCGTGTGCGGACAGGAACGGGGGGTGCAGCCACCTGTGCTT 1920 

598 KVVG TNPCADRNGGCSHLCF 617 

1921 CTTCACACCCCACGCAACCCGGTGTGGCTGCCCCATCGGCCTGGAGCTGCTGAGTGACAT 1980 

618 FTPHATRCGCPIGLELLS DM 637 

1981 GAAGACCTGCATCGTGCCTGAGGCCTTCTTGGTCTTCACCAGCAGAGCCGCCATCCACAG 2 04 0 

638 KTCIVPEAFLVFTSRAAIHR 657 

2 041 GATCTCCCTCGAGACCAATAACAACGACGTGGCCATCCCGCTCACGGGCGTCAAGGAGGC 2100 

658 ISLETNNNDVAI PLTGVKEA 677 

2101 CTCAGCCCTGGACTTTGATGTGTCCAACAACCACATCTACTGGACAGACGTCAGCCTGAA 2160 

678 SALDF DVSNNHIYWTDVSLK 697 

2161 GACCATCAGCCGCGCCTTCATGAACGGGAGCTCGGTGGAGCACGTGGTGGAGTTTGGCCT 222 0 

698 TISRAFNNGSSVEHVVEFGL 717 



Figure 6B 



2221 TGACTACCCCGAGGGCATGGCCGTTGACTGGATGGGCAAGAACCTCTACTGGGCCGACAC 2 2 80 

718 DYPEGMAVDWMGKNLYWADT 737 

2281 TGGGACCAACAGAATCGAAGTGGCGCGGCTGGACGGGCAGTTCCGGCAAGTCCTCGTGTG 2 3 4 0 

738 GTNRIEVARLDGQ FRQVLVW 757 

2 341 GAGGGACTTGGACAACCCGAGGTCGCTGGCCCTGGATCCCACCAAGGGCTACATCTACTG 24 0 0 

758 RDLDNPRSLALDPTKGYIYW 777 

24 01 GACCGAGTGGGGCGGCAAGCCGAGGATCGTGCGGGCCTTCATGGACGGGACCAACTGCAT 24 60 

778 TEWGGKPRIVRAFMDGTNCM 797 

2461 GACGCTGGTGGACAAGGTGGGCCGGGCCAACGACCTC ACCATTGACTACGCTGACCAGCG 2 52 0 

798 TLVDKVGRANDLTIDYADQ R 817 



C CTCTACTGGAC CGAC CTGGAC AC CAAC ATGAT CGAGTCGTCCAAC ATGCTGGGTCAGGA 25 8 0 
LYWTDLDTNMIES SNMLGQE 837 



2521 
818 

2581 GCGGGTCGTGATTGCCGACGATCTCCCGCACCCGTTCGGTCTGACGCAGTACAGCGATTA 264 0 

838 RVVIADDLPHPFGLTQYSDY 857 



TATCTACTGGACAGACTGGAATCTGCACAGCATTGAGCGGGC CGACAAGACTAGCGGC CG 2700 
I YW TDWNLHS IERADKTSGR 877 



2641 
858 

2 7 01 GAACCGCACCCTCATC CAGGGCCAC CTGGACTTCGTGATGGACATC CTGGTGTT CCACTC 27 60 

878 NRTLIQGHLDFVMDILVFHS 897 

27 61 CTCCCGCC AGGATGGCCTCAATGACTGTATGCACAAC AACGGGCAGTGTGGGC AGCTGTG 2 820 

898 SRQDGLNDCMHNNGQCGQLC 917 

2 821 CCTTGCCATCCCCGGCGGCCACCGCTGCGGCTGCGCCTCACACTACACCCTGGACCCCAG 2 880 

918 LAIPGGHRCGCASHYTLDPS 937 

2 8 81 CAGCCGCAACTGCAGCCCGCCCACCACCTTCTTGCTGTTCAGCCAGAAATCTGCCATCAG 2 940 
938 SRNCSPPTTFLLFSQKS AIS 957 

2941 TCGGATGATCCCGGACGACCAGCACAGCCCGGATCTCATCCTGCCCCTGCATGGACTGAG 3 0 00 

958 RMIPDDQHSPDLILPIiHGLR 977 

3 0 01 GAACGTCAAAGCCATCGACTATGACCCACTGGACAAGTTCATCTACTGGGTGGATGGGCG 3 060 
978 NVKAIDYDPLDKFIYWVDGR 997 

3 061 CCAGAACATCAAGCGAGCCAAGGACGACGGGACCCAGCCCTTTGTTTTGACCTCTCTGAG 312 0 

998 QNIKRAKDDGTQPFVLTSLS 1017 



3121 C CAAGGCCAAAACCCAGACAGGCAGCC CCACGACCTCAGCATCGAC ATCTACAGC CGGAC 
1018 QGQNPDRQPHDLSIDIYSRT 



3180 
1037 

3240 



3181 ACTGTTCTGGACGTGCGAGGCCACCAATACCATCAACGTCCACAGGCTGAGCGGGGAAGC 

1038 LFWTCEATNTINVHRLSGEA 1057 

3241 CATGGGGGTGGTGCTGCGTGGGGAGCGCGACAAGCCCAGGGCCATGGTCGTCAACGCGGA 3 3 0 0 

1058 MGVVLRGDRDKPRAI VVNAE 1077 



Figure 6C 



33 01 GCGAGGGTACCTGTACTTCAC CAACATGCAGGAC CGGGCAGCCAAGATCGAACGCGCAGC 3 3 60 

1078 RGYLYFTNMQDRAAKIERAA 1097 

3 3 61 CCTGGACGGCACCGAGCGCGAGGTCCTCTTCACCACCGGCCTCATCCGCCCTGTGGCCCT 342 0 

1098 LDGTEREVLFTTGLIRPVAL 1117 

3421 GGTGGTGGACAACACACTGGGCAAGCTGTTCTGGGTGGACGCGGACCTGAAGCGCATTGA 34 8 0 

1118 VVDNTLGKLFWVDADLKRIE 1137 

3481 GAGCTGTGACCTGTCAGGGGCCAACCGCCTGACCCTGGAGGACGCCAACATCGTGCAGCC 3540 

1138 SCDLSGANRLTLEDANIVQP 1157 

3 541 TCTGGGCCTGACCATCCTTGGCAAGCATCTCTACTGGATCGACCGCCAGCAGCAGATGAT 3 600 

1158 LGL TIIiGKHLYWIDRQQQMI 1177 

3 601 CGAGCGTGTGGAGAAGACCACCGGGGACAAGCGGACTCGCATCCAGGGCCGTGTCGCCCA 3 660 

1178 ERVEKTTGDKRTRI QGRVAH 1197 

3 661 CCTCACTGGCATC CATGCAGTGGAGGAAGTCAGCCTGGAGGAGTTCTG AGCC C ACCCATG 3720 

1198 LTGIHAVEEVSLEEFSAHPC 1217 

3721 TGCCCGTGACAATGGTGGCTGCTCCCACATCTGTATTGCCAAGGGTGATGGGACACCACG 37 80 

1218 ARD NGGCSHICIAKGDGTPR 1237 

3 7 81 GTGCTCATGCCCAGTCCACCTCGTGCTCCTGCAGAACCTGCTGACCTGTGGAGAGCCGCC 3 84 0 

1238 CSCPVHLVLLQNLL TCGEPP 1257 

3 841 CACCTGCTCCCCGGACCAGTTTGCATGTGGCACAGGGGAGATCGACTGTATGCCCGGGGC 3 900 

1258 T C S P D Q F AC AT G E I D C I P G A 1277 

3 901 CTGGCGCTGTGACGGCTTTCCCGAGTGCGATGACCAGAGCGACGAGGAGGGCTGCCCCGT 3 960 
1278 WRCDGFPECDDQSDEEGCPV 1297 

3961 GTGCTCCGCCGCCCAGTTCCCCTGCGCGCGGGGTCAGTGTGTGGACCTGCGCCTGCGCTG 402 0 

1298 CSAAQFPCARGQCVDLRLRC 1317 

4021 CGACGGCGAGGCAGACTGTCAGGACCGCTCAGACGAGGTGGACTGTGACGCCATCTGCCT 4 080 

1318 DGE ADCQDRSDEVDCDAICL 1337 

4 081 GCCCAACCAGTTCCGGTGTGCGAGCGGCCAGTGTGTCCTCATCAAACAGCAGTGCGACTC 414 0 
1338 PNQFRCASGQCVLIKQQCDS 1357 

4141 CTTCCCCGACTGTATCGACGGCTCCGACGAGCTCATGTGTGAAATCACCAAGCCGCCCTC 42 00 

1358 FPDCID GSDELMCEITK PPS 1377 

42 01 AGACGACAGCCCGGCCCACAGCAGTGCCATCGGGCCCGTCATTGGCATCATCCTCTCTCT 4260 

1378 DDSPAHSSAIGPVIGIILSL 1397 

4261 CTTCGTCATGGGTGGTGTCTATTTTGTGTGCCAGCGCGTGGTGTGCCAGCGCTATGCGGG 4 32 0 

1398 FVMGGVYFVCQRVVCQRYAG 1417 

4321 GGCCAACGGGCCCTTCCCGCACGAGTATGTCAGCGGGACCCCGCACGTGCCCCTCAATTT 4 3 8 0 

1418 ANGPFPHEYVS GTPH VPLNF 1437 



Figure 6D 



4 381 CATAGCCCCGGGCGGTTCCCAGCATGGCCCCTTCACAGGCATCGCATGCGGAAAGTCCAT 44 4 0 

1438 IAPGGSQHGPFTGIACGKSM 1457 

4441 GATGAGCTCCGTGAGCCTGATGGGGGGCCGGGGCGGGGTGCCCCTCTACGACCGGAACCA 45 00 

1458 MSSVSLMGGRGGVPLYDRNH 1477 

4501 CGTCACAGGGGCCTCGTCCAGCAGCTCGTCCAGCACGAAGGCCACGCTGTACCCGCCGAT 45 6 0 

1478 VTGASSSSSSSTKATLYPPI 1497 

4 561 CCTGAACCCGCCGCCCTCCCCGGCCACGGACCCGTCCCTGTAC^CATGGACATGTTCTA 462 0 
1498 



LNPP PSPATDPSLYNMDMFY 1517 



4 621 CTCTTCAAACATTCCGGCCACTGCGAGACCGTACAGGCCCTACATCATTCGAGGAATGGC 468 0 

1518 SSNIPATARPYRPYIIRGMA 1537 

4681 GCCCCCGACGACGCCCTGCAGCACCGACGTGTGTGACAGCGACTACAGCGCCAGCCGCTG 4740 

1538 ppTTPCSTD VCDSDYSASRW 1557 

4741 GAAGGC CAGCAAGTACTACCTGGATTTGAAGTCGGACTCAGACC CCTAT CCAC C CC CACC 4800 

1558 KASKYYLDL NSDSDPYPPPP 1577 

4801 CACGCCCCACAGCCAGTACCTGTCGGCGGAGGACAGCTGCCCGCCCTCGCCCGCCACCGA 4 8 60 

1578 T PHSQYLSAE DSCPPSPATE 1597 

4 8 61 GAGGAGCTACTTCCATCTCTTCCCGCCCCCTCCGTCCCCCTGCACGGACTCATCCTGACC 4 920 

1598 RSYFHL FPP PPSPCTDSS 1615 

4921 TCGGCCGGGCCACTCTGGCTTCTCTGTGCCCCTGTAAATAGTTTTAAATATGAACAAAGA 4 9 80 

4981 AAAAAATATATTTTATGATTTAAAAAATAAATATAATTGGGATTTTAAAAACATGAGAAA 504 0 

5041 TGTGAACTGTGATGGGGTGGGCAGGGCTGGGAGAACTTTGTACAGTGGAGAAATATTTAT 5100 

5101 AAACTTAATT TTGTAAAAC A 512 0 



Figure 6E 
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SEQUENCE LISTING 

- . V 

<110> John P. Carulli et al . 
<120> THE HIGH BONE MASS GENE OF llql3.3 
<130> 032796-014 
<150> US 09/229,319 
<151> 1999-01-13 
<150> US 60/071,449 
<151> 1998-01-13 
<150> US 60/105,511 
<151> 1998-10-23 ,; 

<160> 62 - 

<210> 1 
<211> 5120 
<212> DNA 

<213> Homo sapiens ■ ' - - 

<400> 1 

actaaagcgc cgccgccgcg ccatggagcc cgagtgagcg cggcgcgggc ccgtccggec 60 
gccggacaac atg gag gca gcg ccg ccc ggg ccg ccg tgg ccg ctg ctg 109 
Met Glu Ala Ala Pro Pro Gly Pro Pro Trp Pro Leu Leu 
l 5 10 

ctg ctg ctg ctg ctg ctg ctg gcg ctg tgc ggc tgc ccg gcc ccc gcc 157 
Leu Leu Leu Leu Leu Leu Leu Ala Leu Cys Gly Cys Pro Ala Pro Ala 

15 20 25 

gcg gcc teg ccg etc ctg eta ttt gcc aac cge'egg gac gta egg ctg 205 
Ala Ala Ser Pro Leu Leu Leu Phe Ala Asn Arg Arg Asp Val Arg Leu 
30 35 .40 45 



2 

h. 

gtg gac gcc ggc gga gtc aag ctg gag tec acc ate gtg gtc age ggc 253 
Val Asp Ala Gly Gly Val Lys Leu Glu S er Thr lie Val Val Ser Gly 

50 55 60 

ctg gag gat gcg gcc gca gtg gac ttc cag ttt tec aag gga gcc gtg 301 
Leu Glu Asp Ala Ala Ala Val Asp Phe Gin Phe Ser Lys Gly Ala Val 

65 : 70 ' 75 

tac tgg aca gac gtg age gag gag gcc ate aag cag acc tac ctg aac 349 
Tyr Trp Thr Asp Val Ser Glu Glu Ala lie Lys Gin Thr Tyr Leu Asn 

80 85 90 

cag acg ggg gcc gcc gtg cag aac gtg gtc ate tec ggc ctg gtc tct 397 
Gin Thr Gly Ala Ala Val Gin Asn Val Val lie Ser Gly Leu Val Ser 

95 100 • 105 

ccc gac ggc etc gcc tgc gac tgg gtg ggc aag aag ctg tac tgg acg 445 
Pro Asp Gly Leu Ala Cys Asp Trp Val Gly Lys Lys Leu Tyr Trp Thr 
110 115 120 125 

gac tea gag acc aac cgc ate gag gtg gcc aac etc aat ggc aca tec 493 
Asp Ser Glu Thr Asn Arg lie Glu Val Ala Asn Leu Asn Gly Thr Ser 

130 135 140 

egg aag gtg etc ttc tgg cag gac ctt gac cag ccg agg gcc ate gcc 541 
Arg Lys Val Leu Phe Trp Gin Asp Leu Asp Gin Pro Arg Ala lie Ala 

145 \ 150 155 

ttg gac ccc get cac ggg tac atg tac tgg aca gac tgg ggt gag acg 589 
Leu Asp Pro Ala His Gly Tyr Met Tyr Trp Thr Asp Trp Gly Glu Thr 

160 165 170 

ccc egg att gag egg gca ggg atg gat ggc age acc egg aag ate att 637 
Pro Arg lie Glu Arg Ala Gly Met Asp Gly Ser Thr Arg Lys lie lie 

175 180 185 

gtg gac teg gac att tac tgg ccc aat gga ctg acc ate gac ctg gag 685 
Val Asp Ser Asp lie Tyr Trp Pro Asn Gly Leu Thr lie Asp Leu Glu 



190 

gag cag aag etc 
Glu Gin Lys Leu 

gec aac ctg gac 
Ala Asn Leu Asp 
225 

acg cac ccc ttc 
Thr His Pro Phe 
240 

gac tgg cag acc 
Asp Trp Gin Thr 
255 

aag agg aag gag 
Lys Arg Lys Glu 
270 

gtg ctg age cag 
Val Leu Ser Gin 

gac aat ggc ggc 
Asp Asn Gly Gly 
305 

ttc tac aca tgc 
Phe Tyr Thr Cys 
320 

agg acg tgt aag 
Arg Thr Cys Lys 
335 

acg gac eta egg 



195 

X. 

tac tgg get gac 
Tyr Trp Ala Asp 
210 

ggc teg ttc egg 
Gly Ser Phe Arg 

gee ctg acg etc 
Ala Leu Thr Leu 
245 

cgc tec ate cat 
Arg Ser lie His 

260 

ate etg agt gec 
He Leu Ser Ala 
275 

gag egg cag cct 
Glu Arg Gin Pro 
290 

tgc tec cac ctg 
Cys Ser His Leu 

gee tgc ccc acg 
Ala Cys Pro Thr 
325 

gca gga gee gag 
Ala Gly Ala Glu 
340 

agg ate teg ctg 



200 

gec aag etc age 
Ala Lys Leu Ser 
215 

cag aag gtg gtg 
Gin Lys Val Val 
230 

tec ggg gac act 
Ser Gly Asp Thr 

gee tgc aac aag 
Ala Cys Asn Lys 

265 

etc tac tea ccc 
Leu Tyr Ser Pro 
280 

ttc ttc cac act 
Phe Phe His Thr 
295 

tgc ctg ctg tec 
Cys Leu Leu Ser 
310 

9SF.t gtg cag ctg 
Gly Val Gin Leu 

gag gtg ctg ctg 
Glu Val Leu Leu 
345 

gac acg ccg gac 



205 

ttc ate cac cgt 
Phe He His Arg 
220 

ggc age ctg 
Glu Gly Ser Leu 

235 

ctg tac tgg aca 
Leu Tyr Trp Thr 
250 

cgc act ggg ggg 
Arg Thr Gly Gly 

atg gac ate cag 
Met Asp He Gin 
285 

cgc tgt gag gag 
Arg Cys Glu Glu 
300 

cca age gag cct 
Pro Ser Glu Pro 
315 

cag gac aac ggc 
Gin Asp Asn Gly 
330 

ctg gee egg egg 
Leu Ala Arg Arg 

ttc acc gac ate 



Thr Asp Leu Arg Arg lie Ser Leu 
350 . . - 355 
gtg ctg cag gtg gac gac ate egg 
Val Leu Gin Val Asp Asp lie Arg 
370 

ccg eta gag ggc tat gtc tac tgg 
Pro Leu Glu Gly Tyr Val Tyr Trp 

385 

cgc agg gcg tac ctg gac ggg tct 
Arg Arg Ala Tyr Leu Asp Gly Ser 
400 405 
gag ate aac gac ccc gat ggc ate 
Glu lie Asn Asp Pro Asp Gly lie 

415 420 
etc tac tgg ace gac acg ggc acg 
Leu Tyr Trp Thr Asp Thr Gly Thr 
430 435 
aac ggc acc tec cgc aag ate ctg 
Asn Gly Thr Ser Arg Lys lie Leu 
450 

cga gee ate gca ctg cac ccc gtg 
Arg Ala He Ala Leu His Pro Val 
465 

tgg gga gag aac cct aaa ate gag 
Trp Gly Glu Asn Pro Lys He Glu 
480 485 
egg cgt gtg ctg gtc aat gec tec 
Arg Arg Val Leu Val Asn Ala Ser 
495 . 500 



Asp Thr Pro Asp Phe Thr Asp lie 
360 365 
cac gee att gee ate gac tac gac 
His Ala He Ala He Asp Tyr Asp 

375 380 
aca gat gac gag gtg egg gee ate 
Thr Asp Asp Glu Val Arg Ala He 

390 395 

999 gcg cag acg ctg gtc aac acc 

Gly Ala Gin Thr Leu Val Asn Thr 
410 

gcg gtc gac tgg gtg gee cga aac 

Ala Val Asp Trp Val Ala Arg Asn 
425 

gac cgc ate gag gtg acg cgc etc 

Asp Arg He Glu Val Thr Arg Leu 

440 445 

gtg teg gag gac ctg gac gag ccc 
Val Ser Glu Asp Leu Asp Glu Pro 

455 460 
atg ggc etc atg tac tgg aca gac 
Met Gly Leu Met Tyr Trp Thr Asp 
470 475 
tgt gec aac ttg gat ggg cag gag 
Cys Ala Asn Leu Asp Gly Gin Glu 
490 

etc ggg tgg ccc aac ggc ctg gec 
Leu Gly Trp Pro Asn Gly Leu Ala 
505 



ctg gac ctg cag gag ggg aag etc 
Leu Asp Leu Gin Glu Gly Lys Leu 
510 515 
aag ate gag gtg ate aat gtt gat 
Lys lie Glu Val lie Asn Val Asp 
530 

gag gac aag etc ccg cap att ttc 
Glu Asp Lys Leu Pro His lie Phe 
545 

ate tac tgg act gac tgg cag cgc 
lie Tyr Trp Thr Asp Trp Gin Arg 
560 565 
gtc aag gec age egg gac gtc ate 
Val Lys Ala Ser Arg Asp Val lie 

575 580 
ggg etc aaa get gtg aat gtg gec 
Gly Leu Lys Ala Val Asn Val Ala 
590 595 
gcg gac agg aac ggg ggg tgc age 
Ala Asp Arg Asn Gly Gly Cys Ser 
610 

gca ace egg tgt.ggc tgc ccc ate 
Ala Thr Arg Cys Gly Cys Pro He 
625 

aag acc tgc ate gtg cct gag gec 
Lys Thr Cys lie Val Pro Glu Ala 
640 645 
gee ate cac agg ate tec etc gag 
Ala He His Arg lie Ser Leu Glu 



tac tgg gga gac gec aag aca gac 
Tyr Trp Gly Asp Ala Lys Thr Asp 
520 525 
ggg acg aag agg egg acc etc ctg- 
Gly Thr Lys Arg Arg Thr Leu Leu 

535 . 540 

ggg ttc acg ctg ctg ggg gac ttc 
Gly Phe Thr Leu Leu Gly Asp Phe 
550 555 
cgc age ate gag egg gtg cac aag 
Arg Ser He Glu Arg Val His Lys 
570 

att gac cag ctg ccc gac ctg atg 
He Asp Gin Leu Pro Asp Leu Met 
585 

aag gtc gtc gga acc aac ccg tgt 
Lys Val Val Gly Thr Asn Pro Cys 
600 - 605 
cac ctg tgc ttc ttc aca ccc cac 
His Leu Cys Phe Phe Thr Pro His 

615 620 
ggc ctg gag ctg ctg agt gac atg 
Gly Leu Glu Leu Leu Ser Asp Met 
630 635 
ttc ttg gtc ttc acc age aga gec 
Phe Leu Val Phe Thr Ser Arg Ala 
650 

acc aat aac aac gac gtg gee ate 
Thr Asn Asn Asn Asp Val Ala lie 



6 



655 660 665 

ccg etc acg ggc gtc aag gag gee tea gec ctg gac ttt gat gtg tec 2125 
Pro Leu Thr Gly Val Lys Glu Ala Ser Ala Leu Asp Phe Asp Val Ser 
670 675 680 685 

aac aac cac ate tac tgg aca gac gtc age ctg aag ace ate age cgc 2173 
Asn Asn His lie Tyr Trp Thr Asp Val Ser Leu Lys Thr lie Ser Arg 

690 695 700 

gec ttc atg aac ggg age teg gtg gag cac gtg gtg gag ttt ggc ctt 2221 
Ala Phe Met Asn Gly Ser Ser Val Glu His Val Val Glu Phe Gly Leu 

705 710 715 

gac tac ccc gag ggc atg gec gtt gac tgg atg. ggc aag aac etc tac 2269 
Asp Tyr Pro Glu Gly Met Ala Val Asp Trp Met Gly Lys Asn Leu Tyr 

720 725 730 

tgg gee gac act ggg ace aac aga ate gaa gtg gcg egg ctg gac ggg - 2317 
Trp Ala Asp Thr Gly Thr Asn Arg lie Glu Val Ala Arg Leu Asp Gly 
735 740 745 

cag ttc egg caa gtc etc gtg tgg agg gac ttg gac aac ccg agg teg 2365 

Gin Phe Arg Gin Val Leu Val Trp Arg Asp Leu Asp Asn Pro Arg Ser 

750 755 760 765 

ctg gec ctg gat ccc acc aag ggc tac ate tac tgg ace gag tgg ggc 2413 

Leu Ala Leu Asp Pro Thr Lys Gly Tyr lie Tyr Trp Thr Glu Trp Gly 

770 775 780 

ggc aag ccg agg ate gtg egg gee ttc atg gac ggg acc aac tgc atg 2461 
Gly Lys Pro Arg lie Val Arg Ala Phe Met Asp Gly Thr Asn Cys Met 

785. 790 795 

acg ctg gtg gac aag gtg ggc egg gec aac gac etc acc att gac tac 2509 
Thr Leu Val Asp Lys Val Gly Arg Ala Asn Asp Leu Thr lie Asp Tyr 

800 805 810 

get gac cag cgc etc tac tgg acc gac ctg gac acc aac atg ate gag 2557 



Ala Asp Gin Arg 
815 

teg tec aac atg 
Ser Ser Asn Met 
830 

ccg cac ccg ttc 
Pro His Pro Phe 

gac tgg aat ctg 
Asp Trp Asn Leu 
865 

aac cgc acc etc 
Asn Arg Thr Leu 
880 

gtg ttc cac tec 
Val Phe His Ser 
895 

aac ggg cag tgt 
Asn Gly Gin Cys 
910 

tgc ggc tgc gec 
Cys Gly Cys Ala 

age ccg ccc acc 
Ser Pro Pro Thr 
945 

egg atg ate ccg 
Arg Met lie Pro 



Leu Tyr Trp Thr 
820 

ct 9 9£t ca 9 9 a 9 
Leu Gly Gin Glu 
835 

ggt ctg acg cag 
Gly Leu Thr Gin 
850 

cac age att gag 
His Ser lie Glu 

ate cag ggc cac 
lie Gin Gly His 
885 

tec cgc cag gat 
Ser Arg Gin Asp 
900 

ggg cag ctg tgc 
Gly Gin Leu Cys 
915 

tea cac tac acc 
Ser His Tyr Thr 
930 

acc ttc ttg ctg 
Thr Phe Leu Leu 

gac gac cag cac 
Asp Asp Gin His 



Asp Leu Asp Thr 
825 

egg gtc gtg att 
Arg Val Val He 
840 

tac age gat tat 
Tyr Ser Asp Tyr 
855 

egg gee gac aag 
Arg Ala Asp Lys 

870 . 

ctg gac ttc gtg 
Leu Asp Phe Val 

ggc etc aat gac 
Gly Leu Asn Asp 
905 

ctt gee ate ccc 
Leu Ala He Pro 

920 

ctg gac ccc age 
Leu Asp Pro Ser 
935 

ttc age cag aaa 
Phe Ser Gin Lys 
950. 

age ccg gat etc 
Ser Pro Asp Leu 



Asn Met He Glu 

gec gac gat etc 
Ala Asp Asp Leu 
845 

ate tac tgg aca 
He Tyr Trp Thr 
860 

act age ggc egg 
Thr Ser Gly Arg 
875 

at 9" 9 ac at c ctg 
Met Asp lie Leu 
890 

tgt atg cac aac 
Cys Met His Asn 

ggc ggc cac cgc 
Gly Gly His Arg 

925 

age cgc aac tgc 
Ser Arg Asn Cys 
940 

tct gec ate agt 
Ser Ala He Ser 
955 

ate ctg ccc ctg 
He Leu Pro Leu 



960 



965 



970 
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cat gga'ctg agg aac gtc aaa gcc ate gac tat gac cca ctg gac aag 3037 
His Gly Leu Arg Asn Val Lys Ala lie Asp Tyr Asp Pro Leu Asp Lys 

975 980 985 

ttc ate tac tgg gtg gat ggg cgc cag aac ate aag cga gcc aag gac 3085 
Phe lie Tyr Trp Val Asp Gly Arg Gin Asn lie Lys Arg Ala Lys Asp 

990 995 1000 1005 

gac ggg ace cag ccc ttt gtt ttg acc tct ctg age caa ggc caa aac 3133 
Asp Gly Thr Gin Pro Phe Val Leu Thr Ser Leu Ser Gin Gly Gin Asn 

1010 1015 1020 

cca gac agg cag ccc cac gac etc age ate gac ate tac. age egg aca . 3181 

Pro Asp Arg Gin Pro His Asp Leu Ser lie Asp lie Tyr Ser Arg Thr 

1025 1030 1035 

ctg ttc tgg acg tgc gag gcc acc aat acc ate aac gtc cac agg ctg 3229 
Leu Phe Trp Thr Cys Glu Ala Thr Asn Thr lie Asn Val His Arg Leu 
1040 1045 1050 

age ggg gaa gcc atg ggg gtg gtg ctg cgt ggg gac cgc gac aag ccc 32 77 

Ser Gly Glu Ala Met Gly Val Val Leu Arg Gly Asp Arg Asp Lys Pro 

1055 1060 1065 

agg gcc ate gtc gtc aac gcg gag cga ggg tac ctg tac ttc acc aac 3325 
Arg Ala lie. Val Val Asn Ala Glu Arg Gly Tyr Leu Tyr Phe Thr Asn 
1070 1075 1080 1085 

atg cag gac egg gca gcc aag ate gaa cgc gca gcc ctg gac ggc acc 3373 
Met Gin Asp Arg Ala Ala Lys lie Glu Arg Ala Ala Leu Asp Gly Thr 

1090 1095 1100 

gag cgc gag gtc etc ttc acc acc ggc etc ate cgc cct gtg gcc ctg 3421 
Glu Arg Glu Val Leu Phe Thr Thr Gly Leu lie Arg Pro Val Ala Leu 

1105 1110 1115 

gtg gtg gac aac aca ctg ggc aag ctg ttc tgg gtg gac gcg gac ctg 3469 
Val Val Asp Asn Thr Leu Gly Lys Leu Phe Trp Val Asp Ala Asp Leu 
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1120 1125 1130 

aag cgc att gag age tgt gac ctg tea ggg gec aac cgc ctg acc ctg 3517 

Lys Arg lie Glu Ser Cys Asp Leu Ser Gly Ala Asn Arg Leu Thr Leu 

1135 1140 1145 

gag gac gec aac ate gtg cag cct ctg ggc ctg acc ate ctt ggc aag. 3565 
Glu Asp Ala Asn, lie Val Gin Pro Leu Gly Leu Thr lie Leu Gly Lys 
1150 . " 1155 1160 1165 

cat etc tac tgg ate gac cgc cag cag cag atg ate gag cgt gtg gag 3613 

His Leu Tyr Trp lie Asp Arg Gin Gin Gin Met lie Glu Arg Val Glu 

1170 1175 1180 

aag acc acc ggg gac aag egg act cgc ate cag ggc cgt gtc gec cac 3661 
Lys Thr Thr Gly Asp Lys Arg Thr Arg lie Gin Gly Arg Val Ala His 

1185 1190 1195 

etc act ggc ate cat gca gtg gag gaa gtc age ctg gag gag ttc tea 3709 
Leu Thr Gly lie His Ala Val Glu Glu Val Ser Leu Glu Glu Phe Ser 
1200 1205 1210 

99t ggc tgc tec cac ate tgt att 3 757 

Gly Gly Cys Ser His He Cys He 
1225 

tgc tea tgc cca gtc cac etc gtg 3805 
Cys Ser Cys Pro Val His Leu Val 

1240 1245 
gga gag ccg ccc acc tgc tec ccg 3853 
Gly Glu Pro Pro Thr Cys Ser Pro 

1255 1260 
gag ate gac tgt ate ccc ggg gee 3901 
Glu He Asp Cys He Pro Gly Ala 
1270 1275 

tgc gat gac cag age gac gag gag 3949 



gee cac cca tgt 
Ala His Pro Cys 
1215 

gec aag ggt gat 
Ala Lys Gly Asp 
1230 

etc ctg cag aac 
Leu Leu Gin Asn 

gac cag ttt gca 
Asp Gin Phe Ala 
126! 

tgg cgc tgt gac 



gee cgt gac aat 
Ala Arg Asp Asn 
122 0 

ggg aca cca egg 
Gly Thr Pro Arg 

1235 
ctg ctg acc tgt 
Leu Leu Thr Cys 
1250 

tgt gee aca ggg 
Cys Ala Thr Gly 

ggc. ttt ccc gag 
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Trp Arg Cys Asp Gly Phe Pro Glu Cys Asp Asp Gin Ser Asp Glu Glu 

1280 1285 1290 

ggc tgc ccc gtg tgc tec gee gec cag ttc ccc tgc gcg egg ggt cag 3997 
Gly Cys Pro Val Cys Ser Ala Ala Gin Phe Pro Cys Ala Arg Gly Gin 

1295 1300 1305 

tgt gtg gac ctg cgc ctg cgc tgc gac ggc gag gca gac tgt cag gac 4045 

Cys Val Asp Leu Arg Leu Arg Cys Asp Gly Glu Ala Asp Cys Gin Asp 

1310 1315 1320 1325 

cgc tea gac gag gtg gac tgt gac gec ate tgc ctg ccc aac cag ttc 4093 

Arg Ser Asp Glu Val Asp Cys Asp Ala lie Cys Leu Pro Asn Gin Phe 

1330 1335 1340 

egg tgt gcg age ggc cag tgt gtc etc ate aaa cag cag tgc gac tec 4141 
Arg Cys Ala Ser Gly Gin Cys Val Leu lie Lys Gin Gin Cys Asp Ser 
1345 1350 1355 

ttc ccc gac tgt ate gac ggc tec gac gag etc atg tgt gaa ate ace 4189 

Phe Pro Asp Cys lie Asp Gly Ser Asp Glu Leu Met Cys Glu lie Thr 

1360 1365 1370 

aag ccg ccc tea gac gac age ccg gec cac age agt gec ate ggg ccc 4237 

Lys Pro Pro Ser Asp Asp Ser Pro Ala His Ser Ser Ala lie Gly Pro 

1375 1380 1385 

gtc att ggc ate ate etc tct etc ttc gtc atg ggt ggt gtc tat ttt 42 85 

Val He Gly lie lie Leu Ser Leu Phe Val Met , Gly Gly Val Tyr Phe 

1390 1395 1400 1405 

gtg tgc cag cgc gtg gtg tgc cag cgc tat gcg ggg gec aac ggg ccc 4333 
Val Cys Gin Arg Val Val Cys Gin Arg Tyr Ala Gly Ala Asn Gly Pro 

1410 1415 1420 

ttc ccg cac gag tat gtc age ggg acc ccg cac gtg ccc etc aat ttc 4381 
Phe Pro His Glu Tyr Val Ser Gly Thr Pro His Val Pro Leu Asn Phe 
1425 1430 1435 
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ata gcc ccg ggc ggt tec cag cat ggc ccc ttc aca ggc ate gca tgc 4429 
lie Ala Pro Gly Gly Ser Gin His Gly Pro Phe Thr Gly lie Ala Cys 

1440 ' 1445 1450 

gga aag tec atg atg age tec gtg age ctg atg ggg ggc egg ggc ggg 4477 
Gly Lys Ser Met Met Ser Ser Val Ser Leu Met Gly Gly Arg Gly Gly 
1455 1460 1465 

gtg ccc etc tac gac egg aac cac gtc aca ggg gcc teg tec age age 4525 

Val Pro Leu Tyr Asp Arg Asn His Val Thr Gly Ala Ser Ser Ser Ser 

1470 , 1475 1480 . 1485 

teg tec age acg aag gcc acg ctg tac ccg ccg ate ctg aac ccg ccg . 4573 

Ser Ser Ser Thr Lys Ala Thr Leu Tyr Pro Pro lie Leu Asn Pro Pro 

1490 1495 1500 

ccc tec ccg gcc acg gac ccc tec ctg tac aac atg gac atg ttc tac 4621 
Pro Ser Pro Ala Thr Asp Pro Ser Leu Tyr Asn Met Asp Met Phe Tyr 

1505 1510 1515 

tct tea aac att ccg gcc act gcg aga ccg tac agg ccc tac ate att 4669 
Ser Ser Asn lie Pro Ala Thr Ala Arg Pro Tyr Arg Pro Tyr lie lie 

1520 1525 1530 

cga gga atg gcg ccc ccg acg acg ccc tgc age acc gac gtg tgt gac 4717 
Arg Gly Met Ala Pro Pro Thr Thr Pro Cys Ser Thr Asp Val Cys Asp 

1535 1540 1545 

age gac tac age gcc age cgc tgg aag gcc age aag tac tac ctg gat 4765 
Ser Asp Tyr Ser Ala Ser Arg Trp Lys Ala Ser Lys Tyr Tyr Leu Asp 
1550 1555 1560 1565 

ttg aac teg gac tea gac ccc tat cca ccc cca ccc acg ccc cac age 4813 
Leu Asn Ser Asp Ser Asp Pro Tyr Pro Pro Pro Pro Thr Pro His Ser 

1570 1575 1580 

cag tac ctg teg; gcg gag gac age tgc ccg ccc teg ccc gcc acc gag 4861 
Gin Tyr Leu Ser Ala Glu Asp Ser Cys Pro Pro Ser Pro Ala Thr Glu 
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1585 1590 1595 

agg age tac ttc cat etc ttc ccg ccc cct ccg tec ccc tgc acg gac 4909 
Arg Ser Tyr Phe His Leu Phe Pro Pro Pro Pro Ser Pro Cys Thr Asp 

1600 1605 1610 

tea tec tgacctcggc cgggccactc tggcttctct gtgcccctgt aaatagtttt 4 965 

Ser Ser 
1615 

aaatatgaac aaagaaaaaa atatatttta tgatttaaaa aataaatata attgggattt 5025 
taaaaacatg agaaatgtga actgtgatgg ggtgggcagg gctgggagaa ctttgtacag 5085 
tggagaaata tttataaact taattttgta aaaca 5120 



<210> 2 

<211> 5120 

<212> DNA 

<213> Homo sapiens 

<400> 2 

actaaagege cgccgccgcg ccatggagcc cgagtgagcg cggcgcgggc ccgtccggcc 60 
gccggacaac atg gag gca gcg ccg ccc ggg ccg ccg tgg ccg ctg ctg 109 
Met Glu Ala Ala Pro Pro Gly Pro Pro Trp Pro Leu Leu 
15 10 
ctg ctg ctg ctg ctg ctg ctg gcg ctg tgc ggc tgc ccg gec ccc gec 157 
Leu Leu Leu Leu Leu Leu Leu Ala Leu Cys Gly Cys Pro Ala Pro Ala 

15 20 25 

gcg gec teg ccg etc ctg eta ttt gec aac cgc egg gac gta egg ctg 2 05 

Ala Ala Ser Pro Leu Leu Leu Phe Ala Asn Arg Arg Asp Val Arg Leu 
30 35 40 45 

gtg gac gee ggc gga gtc aag ctg gag tec acc ate gtg gtc age ggc 253 
Val Asp Ala Gly Gly Val Lys Leu Glu Ser Thr lie Val Val Ser Gly 



ctg gag gat gcg 
Leu Glu Asp Ala 
65 

tac tgg aca gac 
Tyr Trp Thr Asp 
80 

cag acg ggg gcc 
Gin Thr Gly Ala 
95 

ccc gac ggc etc 
Pro Asp Gly Leu 
110 

gac tea gag acc 
Asp Ser Glu Thr 

egg aag gtg etc 
Arg Lys Val Leu 
145 

ttg gac ccc get 
Leu Asp Pro Ala 

160 

ccc egg att gag 
Pro Arg lie Glu 
175 

gtg gac teg gac 
Val Asp Ser Asp 

190 

gag cag aag etc 



50 

gcc gca gtg gac 
Ala Ala Val Asp 

gtg age gag gag 
Val Ser Glu Glu 
85 

gcc gtg cag aac 
Ala Val Gin Asn 
100 

gcc tgc gac tgg 
Ala Cys Asp Trp 
115 

aac cgc ate gag 
Asn Arg lie Glu 
130 

ttc tgg cag gac 
Phe Trp Gin Asp 

cac ggg tac atg 
His Gly Tyr Met 
165 

c 99 9 ca 999 at 9 
Arg. Ala Gly Met 
180 

att tac tgg ccc 
lie Tyr Trp Pro 

195 

tac tgg get gac 
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55 

ttc cag ttt tec 
Phe Gin Phe Ser 
70 

gcc ate aag cag 
Ala He Lys Gin 

gtg gtc ate tec 
Val Val lie Ser 
105 

9tg ggc aag aag 
Val Gly Lys Lys 
120 

gtg gcc aac etc 
Val Ala Asn Leu 
135 

ctt gac cag ccg 
Leu Asp Gin Pro 
150 

tac tgg aca gac 
Tyr Trp Thr Asp 

9 at 99^ age acc 
Asp Gly Ser Thr 
185 

aat gga ctg acc 
Asn Gly Leu Thr 
200 

gcc aag etc age 



60 

aag gga gcc gtg 
Lys Gly Ala Val 
75 

acc tac ctg aac 
Thr Tyr Leu Asn 
90 

ggc ctg gtc tct 
Gly Leu Val Ser 

ctg tac tgg acg 
Leu Tyr Trp Thr 
125 

aat ggc aca tec 
Asn Gly Thr Ser 
140 

agg gcc ate gcc 
Arg Ala He Ala 
155 

tgg gtt gag acg 
Trp Val Glu Thr 

170 

egg aag ate att 
Arg Lys lie . He 

ate gac ctg gag 
He Asp Leu Glu 

205 

ttc ate cac cgt 



Glu Gin Lys Leu 

gcc aac ctg gac 
Ala Asn Leu Asp 
225 

acg cac ccc ttc 
Thr His Pro Phe 
240 

gac tgg cag acc 
Asp Trp Gin Thr 
255 

aag agg aag gag 
Lys Arg Lys Glu 
270 

gtg ctg age cag 
Val Leu Ser Gin 

gac aat ggc ggc 
Asp Asn Gly Gly 
305 

ttc tac aca tgc 
Phe Tyr Thr Cys 
320 

agg acg tgt aag 
Arg Thr Cys Lys 
335 

acg gac eta egg 
Thr Asp Leu Arg 
350 



Tyr Trp Ala Asp 
210 

ggc teg ttc egg 
Gly Ser Phe Arg 

gee ctg acg etc 
Ala Leu Thr Leu 
245 

cgc tec ate cat 
Arg Ser lie His 
260 

ate ctg agt gcc 
lie Leu Ser Ala 
275 

gag egg cag cct 
Glu Arg Gin Pro 
290 

tgc tec cac ctg 
Cys Ser His Leu 

gcc tgc ccc acg 
Ala Cys Pro Thr 
325 

gca gga gcc gag 
Ala Gly Ala Glu 
340 

^99 a fcc tc 9 ct 9 
Arg lie Ser Leu 

355 . 
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Ala Lys Leu Ser 

. 215 
cag aag gtg gtg 
Gin Lys Val Val 
230 

tec ggg gac act 
Ser Gly Asp Thr 

gcc tgc aac aag 
Ala Cys Asn Lys 
265 

etc tac tea ccc 
Leu Tyr Ser Pro 
280 

ttc ttc cac act 
Phe Phe His Thr 
295 

tgc ctg ctg tec 
Cys Leu Leu Ser 
310 

ggt gtg cag ctg 
Gly Val Gin Leu 

gag gtg ctg ctg 
Glu Val Leu Leu 
345 

gac acg ccg gac 
Asp Thr Pro Asp 
360 



Phe lie His Arg 
220 

gag ggc age ctg 
Glu Gly Ser Leu 
235 

ctg tac tgg aca 
Leu Tyr Trp Thr 
250 

cgc act ggg ggg 
Arg Thr Gly Gly 

atg gac ate cag 
Met Asp lie Gin 
285 

cgc tgt gag gag 
Arg Cys Glu Glu 
300 

cca age gag cct 
Pro Ser Glu Pro 
315 

cag gac aac ggc 
Gin Asp Asn Gly 
330 

ctg gcc egg egg 
Leu Ala Arg Arg 

ttc acc gac ate 
Phe Thr Asp lie 
365 
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gtg ctg cag gtg gac gacatc egg cac gec att gec ate gac tac gac 

Val Leu Gin Val Asp Asp lie Arg His Ala lie Ala lie Asp Tyr Asp 

370 375 380 

ccg eta gag ggc tat gtc tac tgg aca gat gac gag gtg egg gec ate 

Pro Leu Glu Gly Tyr Val Tyr Trp Thr Asp Asp Glu Val Arg Ala lie 
385 390 395 

cgc agg gcg tac ctg gac ggg tct ggg gcg cag acg ctg gtc aac acc 
Arg Arg Ala Tyr Leu Asp Gly Ser Gly Ala Gin Thr Leu Val Asn Thr 

400 405 410 

gag ate aac gac ccc gat ggc ate gcg gtc gac tgg gtg gec cga aac 
Glu lie Asn Asp Pro Asp Gly lie Ala Val Asp Trp Val Ala Arg Asn 

415 420 425 

etc tac tgg acc gac acg ggc acg gac cgc ate gag gtg acg cgc etc 
Leu Tyr Trp Thr Asp Thr Gly Thr Asp . Arg lie Glu Val Thr Arg Leu 
430 435 440 445 

aac ggc acc tec cgc aag ate ctg gtg teg gag gac ctg gac gag ccc 
Asn Gly Thr Ser Arg Lys lie Leu Val Ser Glu Asp Leu Asp Glu Pro 

. 450 455 460 

cga gee ate gca. ctg cac ccc gtg atg ggc etc atg tac tgg aca gac 
Arg Ala lie Ala Leu His Pro Val Met Gly Leu Met Tyr Trp Thr Asp 

465 470 475 

tgg gga gag aac cct aaa ate gag tgt gee aac ttg gat ggg cag gag 
Trp Gly Glu Asn Pro Lys lie Glu Cys Ala Asn Leu Asp Gly Gin Glu 

480 . 485 490 

egg cgt gtg ctg gtc aat gec tec etc ggg tgg ccc aac ggc ctg gec 
Arg Arg Val Leu Val Asn Ala Ser Leu Gly Trp Pro Asn Gly Leu Ala 

495 500 505 

ctg gac ctg cag gag ggg aag etc tac tgg gga gac gee aag aca gac 
Leu Asp Leu Gin Glu Gly Lys Leu Tyr Trp Gly Asp Ala Lys Thr Asp 
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I: 



510 515 520 525 

\ 

aag ate gag gtg ate aat gtt gat ggg acg aag agg egg acc etc ctg 1693 
Lys lie Glu Val lie Asn Val Asp Gly Thr Lys Arg Arg Thr Leu Leu 

530 535 540 

gag gac aag etc ccg cac att ttc ggg ttc acg ctg ctg ggg gac ttc 1741 

Glu. Asp Lys Leu Pro His lie Phe Gly Phe Thr Leu Leu Gly Asp Phe 

545 550 555 

ate tac tgg act gac tgg cag cgc cgc age ate gag egg gtg cac aag 1789 

lie Tyr Trp Thr Asp Trp Gin Arg Arg Ser lie Glu Arg Val His Lys 

560 .565 570 

gtc aag gec age egg gac gtc ate att gac cag ctg ccc gac ctg atg 1837 

Val Lys Ala Ser Arg Asp Val lie lie Asp Gin Leu Pro Asp Leu Met 

575 580 585 

999 etc aaa get gtg aat gtg gec aag gtc gtc gga acc aac ccg tgt 1885 
Gly Leu Lys Ala Val Asn Val Ala Lys Val Val Gly Thr Asn Pro Cys 
590 595 600 605 

gcg gac agg aac ggg ggg tgc age cac ctg tgc ttc ttc aca ccc cac 1933 
Ala Asp Arg Asn Gly Gly Cys Ser His Leu Cys Phe Phe Thr Pro His 

610 615 620 

gca acc egg tgt ggc tgc ccc ate ggc ctg gag ctg ctg agt gac atg 1981 
Ala Thr Arg Cys Gly Cys Pro lie Gly. Leu Glu Leu Leu Ser Asp Met 

625 630 635 

aag acc tgc ate gtg cct gag gec ttc ttg gtc ttc acc age aga gee 2029 
Lys Thr Cys lie Val Pro Glu Ala Phe Leu Val Phe Thr Ser Arg Ala 

640 645 650 

gee ate cac agg ate tec etc gag acc aat aac aac gac gtg gee ate 2077 
Ala lie His Arg lie Ser Leu Glu Thr Asn Asn Asn Asp Val Ala lie 

655 660 665 

ccg etc acg ggc gtc aag gag gec tea gec ctg gac ttt gat gtg tec 2125 



Pro Leu Thr Gly 
670 

aac aac cac ate 

Asn Asn His lie 

gec ttc atg aac 
Ala Phe Met Asn 
705 

gac tac ccc gag 
Asp Tyr Pro Glu 
72 0 

tgg gec gac act 
Trp Ala Asp Thr 
735 

cag ttc egg caa 
Gin Phe Arg Gin 

750 

ctg gec ctg gat 
Leu Ala Leu Asp 

ggc aag ccg agg 
Gly Lys Pro Arg 
785 

acg ctg gtg gac 
Thr Leu Val Asp 
800 

get gac cag cgc 
Ala Asp Gin Arg 
815 



Val Lys Glu Ala 
675 

tac tgg aca gac 
Tyr Trp Thr Asp 
690 

ggg age teg gtg 
Gly Ser Ser Val 

ggc atg gec gtt 
Gly Met Ala Val 
725 

ggg acc aac aga 
Gly Thr Asn Arg 
740 

gtc etc gtg tgg 
Val Leu Val Trp 
755 

ccc acc aag ggc 
Pro Thr Lys Gly 
770 

ate gtg egg gec 
He Val Arg Ala 

aag gtg ggc egg 
Lys Val Gly Arg 
805 

etc tac tgg acc 
Leu Tyr Trp Thr 
820 
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Ser Ala Leu Asp 
680 

gtc age ctg aag 
Val Ser Leu Lys 
695 

gag cac gtg gtg 
Glu His Val Val 

710 

gac tgg atg ggc 
Asp Trp Met Gly 

ate gaa gtg gcg 
He Glu Val Ala 
745 

a 99 9 ac ttg g&c 

Arg Asp Leu Asp 
760 

tac ate tac tgg 
Tyr He Tyr Trp 
775 

ttc atg gac ggg 
Phe Met Asp Gly 
790 

gec aac gac etc 
Ala Asn Asp Leu 

gac ctg gac acc 
Asp Leu Asp Thr 
825 



Phe Asp Val Ser 

\ 

685 

acc ate age cgc 
Thr He Ser Arg 
700 

gag ttt ggc ctt 
Glu Phe Gly Leu 
715 

aag aac etc tac 
Lys Asn Leu Tyr 
730 

egg ctg gac ggg 
Arg Leu Asp Gly 

aac ccg agg teg 
Asn Pro Arg Ser 

765 

acc gag tgg ggc 
Thr Glu Trp Gly 
780 

acc aac tgc atg 
Thr Asn Cys Met 
795 

acc att gac tac 
Thr He Asp Tyr 
810 

aac atg ate gag 
Asn Met lie Glu 



teg tec aac atg ctg 
Ser Ser Asn Met Leu 
83 0 

ccg cac ccg ttc ggt 
Pro His Pro Phe Gly 

.85 0; 

gac tgg aat ctg cac 
Asp Trp Asn Leu His 
865 

aac cgc acc etc ate 
Asn Arg Thr Leu lie 
880 

gtg ttc cac tec tec 
Val Phe His Ser Ser 
895 

aac ggg cag tgt ggg 
Asn Gly Gin Cys Gly 
910 

tgc ggc tgc gee tea 
Cys Gly Cys Ala Ser 
930 

age ccg ccc acc acc 
Ser Pro Pro Thr Thr 

: 945 

egg atg ate ccg gac 
Arg Met lie Pro Asp 
960 

cat gga ctg agg aac 
His Gly Leu Arg Asn 
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ggt cag gag egg gtc 
Gly Gin Glu Arg Val 
835 

ctg acg cag tac age 
Leu Thr Gin Tyr Ser 
855 

age att gag egg gee 
Ser lie Glu Arg Ala 
870 

cag ggc cac ctg gac 
Gin Gly His Leu Asp 
885 

cgc cag gat ggc etc 
Arg Gin Asp Gly Leu 
900 

cag ctg tgc ctt gee 
Gin Leu Cys Leu Ala 
915 

cac tac acc ctg gac 
His Tyr Thr Leu Asp 
935 

ttc ttg ctg ttc age 
Phe Leu Leu Phe Ser 
95 0 

gac cag cac age ccg 
Asp Gin His Ser Pro 
965 

gtc aaa gec ate gac 
Val Lys Ala lie Asp 



gtg att gee gac gat etc 
Val lie Ala Asp Asp Leu 

840 845 
gat tat ate tac tgg aca 
Asp Tyr lie Tyr Trp Thr 
860 

gac aag act age ggc egg 
Asp Lys Thr Ser Gly Arg 
875 

ttc gtg atg gac ate ctg 
Phe Val Met Asp lie Leu 
890 

aat gac tgt atg cac aac 
Asn Asp Cys Met His Asn 
905 

ate ccc ggc ggc cac cgc 
lie Pro Gly Gly His Arg 
920 925 
ccc age age cgc aac tgc 
Pro Ser Ser Arg Asn Cys 
940 

cag aaa tct gee ate agt 
Gin Lys Ser Ala lie Ser 
955 

gat etc ate ctg ccc ctg 
Asp Leu lie Leu Pro Leu 
970 

tat gac cca ctg gac aag 
Tyr Asp Pro Leu Asp Lys 
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975 980 
ttc ate tac tgg gtg gat ggg cgc 
Phe lie Tyr Trp Val Asp Gly Arg 
990 995 
gac ggg acc cag ccc ttt gtt ttg 
Asp Gly Thr Gin Pro Phe Val Leu 

1010 

cca gac agg cag ccc cac gac etc 

Pro Asp Arg. Gin Pro His Asp Leu 
1025 

ctg ttc tgg acg tgc gag gee acc 

Leu Phe Trp Thr Cys Glu Ala Thr 

104 0 104! 

age ggg gaa gec atg ggg gtg gtg 

Ser Gly Glu Ala Met Gly Val Val 

1055 1060 

agg gec ate gtc gtc aac gcg gag 

Arg Ala He Val Val Asn Ala Glu 
1070 . 1075 

atg cag gac egg gca gee aag ate 

Met Gin Asp Arg Ala Ala Lys He 
1090 

gag cgc gag gtc etc ttc acc acc 

Glu Arg Glu Val Leu Phe Thr Thr 
1105 

gtg gtg gac aac aca ctg ggc aag 
Val Val Asp Asn Thr Leu Gly Lys 

112 0 112. 

aag cgc att gag age tgt gac ctg 



985 

\ 

cag aac ate aag cga gee aag gac 

Gin Asn He Lys Arg Ala Lys Asp 

1000 1005 

acc tct ctg age caa ggc caa aac 
Thr Ser Leu Ser Gin Gly Gin Asn 

1015 1020 

age ate gac ate tac age egg aca 
Ser lie Asp He Tyr Ser Arg Thr 
1030 1035 
aat acc ate aac gtc cac agg ctg 
Asn Thr He Asn Val His Arg Leu 
1050 

ctg cgt ggg gac cgc gac aag ccc 
Leu Arg Gly Asp Arg Asp Lys Pro 
1065 

cga ggg tac ctg tac ttc acc aac 
Arg Gly Tyr Leu Tyr Phe Thr Asn 

1080 1085 
gaa cgc gca gec ctg gac ggc acc 
Glu Arg Ala Ala Leu Asp Gly Thr 

1095 1100 
ggc etc ate cgc cct gtg gee ctg 
Gly Leu He Arg Pro Val Ala Leu 
1110 1115 
ctg ttc tgg gtg gac gcg gac ctg 
Leu Phe Trp Val Asp Ala Asp Leu 
i 1130 
tea ggg gec aac cgc ctg acc ctg 
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Lys Arg lie Glu Ser Cys Asp Leu Ser Gly Ala Asn Arg Leu Thr Leu 

1135 1140 1145 

gag gac gcc aac ate gtg cag cct ctg ggc ctg acc ate ctt ggc aag 3565 
Glu Asp Ala Asn lie Val Gin Pro Leu Gly Leu Thr lie Leu Gly Lys 
1150 1155 1160 1165 

cat etc tac tgg ate gac cgc cag cag cag atg ate gag cgt gtg gag 3 613 

His Leu Tyr.Trp lie Asp Arg Gin Gin Gin Met lie Glu Arg Val Glu 

1170 1175 1180 

aag acc acc-ggg gac aag egg act cgc ate cag ggc cgt gtc gee cac 3661 
Lys Thr Thr Gly Asp Lys Arg Thr Arg lie Gin Gly Arg Val Ala His 

1185 - 1190 1195 

etc act ggc ate cat gca gtg gag gaa gtc age ctg gag gag ttc tea 3709 
Leu Thr Gly lie His Ala Val Glu Glu Val Ser Leu Glu Glu Phe Ser 

1200 1205 1210 

gcc cac cca tgt gcc cgt gac aat ggt ggc tgc tec cac ate tgt att 3757 
Ala His Pro Cys Ala Arg Asp Asn Gly Gly Cys Ser His He Cys He 

1215 1220 1225 

gcc aag ggt gat ggg aca cca egg tgc tea tgc cca gtc cac etc gtg 3805 

Ala Lys Gly Asp Gly Thr Pro Arg Cys Ser Cys Pro Val His Leu Val 
1230 / 1235 1240 1245 

etc ctg cag aac ctg ctg acc tgt gga gag ccg ccc acc tgc tec ccg 3853 

Leu Leu Gin Asn Leu Leu Thr Cys Gly Glu Pro Pro Thr Cys Ser Pro 

1250 1255 1260 

gac cag ttt gca tgt gcc aca ggg gag ate gac tgt ate ccc ggg gcc 3 901 

Asp Gin Phe Ala Cys Ala Thr Gly Glu lie Asp Cys He Pro Gly Ala 
1265 1270 1275 

tgg cgc tgt gac ggc ttt ccc gag tgc gat gac cag age gac gag gag 3949 
Trp Arg Cys Asp Gly Phe Pro Glu Cys Asp Asp Gin Ser Asp Glu Glu 
1280 1285 1290 



h. 
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ggc tgc ccc gtg tgc tec gee gec cag ttc ccc tgc gcg egg ggt cag 3997 

Gly Cys Pro Val Cys'-Ser Ala Ala Gin Phe Pro Cys Ala Arg Gly Gin 

1295 - 1300 1305 

tgt gtg gac ctg cgc ctg cgc tgc gac ggc gag gca gac tgt cag gac 4045 

Cys Val Asp Leu Arg Leu Arg Cys Asp Gly Glu Ala Asp Cys Gin Asp 
1310 1315 1320 1325 

tea gac gag gtg gac tgt gac gec ate tgc ctg ccc aac cag ttc 4093 

Ser Asp Glu Val Asp Cys Asp Ala lie Cys Leu Pro Asn Gin Phe 

1330 1335 1340 

tgt gcg age ggc cag tgt gtc etc ate aaa. cag cag tgc gac tec 4141 

Cys Ala Ser Gly Gin Cys Val Leu He Lys Gin Gin Cys Asp Ser 

1345 1350 1355 

ccc gac tgt ate gac ggc tec gac gag etc atg tgt gaa ate ace 4189 
Pro Asp Cys He Asp Gly Ser Asp Glu Leu Met Cys Glu He Thr 

1360 1365 1370 

ccg ccc tea gac gac age ccg gee cac age agt gec ate ggg ccc 4237 
Pro Pro Ser Asp Asp Ser Pro Ala His Ser Ser Ala He Gly Pro 
1375 1380 1385 

att ggc ate ate etc tct etc ttc gtc atg ggt ggt gtc tat ttt 4285 
He Gly He He Leu Ser Leu Phe Val Met Gly Gly Val Tyr Phe 
1390 1395 1400 1405 

gtg tgc cag cgc gtg gtg tgc cag cgc tat gcg ggg gec aac ggg ccc 4333 
Val Cys Gin Arg Val Val Cys Gin Arg Tyr Ala Gly Ala Asn Gly Pro 

1410 1415 1420 

ttc ccg cac gag tat gtc age ggg ace ccg cac gtg ccc etc aat ttc 4381 
Phe Pro His Glu Tyr Val Ser Gly Thr Pro His Val Pro Leu Asn Phe 

1425 1430 1435 

ata gec ccg ggc ggt tec cag cat ggc ccc ttc aca ggc ate gca tgc 442 9 

He Ala Pro Gly Gly Ser Gin His Gly Pro Phe Thr Gly He Ala Cys 
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1440 1445 1450 

gga aag tec atg atg age tec gtg age ctg atg ggg ggc egg ggc ggg 4477 
Gly Lys Ser Met Met Ser Ser Val Ser Leu Met Gly Gly Arg Gly Gly 

1455 1460 1465 

gtg ccc etc tac gac egg aac cac gtc aca ggg gee teg tec age age 4525 
Val Pro Leu Tyr Asp Arg Asn His Val Thr Gly Ala Ser Ser Ser Ser 
1470 1475 1480 1485 

teg tec age acg aag gee acg ctg tac ccg ccg ate ctg aac ccg ccg 4573 
Ser Ser Ser Thr Lys Ala Thr Leu Tyr Pro Pro lie Leu Asn Pro Pro 

1490 1495 1500 

ccc tec ccg gec acg gac ccc tec ctg tac aac atg gac atg ttc tac 4621 
Pro Ser Pro Ala Thr Asp Pro Ser Leu Tyr Asn Met Asp Met Phe Tyr 

1505 1510 1515 

tct tea aac att ccg gee act gcg aga ccg tac agg ccc tac ate att 4 669 

Ser Ser Asn lie Pro Ala Thr Ala Arg Pro Tyr Arg Pro Tyr lie lie 

1520 1525 1530 

cga gga atg gcg ccc ccg acg acg ccc tgc age ace gac gtg tgt gac 4717 
Arg Gly Met Ala Pro Pro Thr Thr Pro Cys Ser Thr Asp Val Cys Asp 

1535 1540 1545 

age gac tac age gee age cgc tgg aag gec age aag tac tac ctg gat 4765 
Ser Asp Tyr Ser Ala Ser Arg Trp Lys Ala Ser Lys Tyr Tyr Leu Asp 
1550 1555 1560 1565 

ttg aac teg gac tea gac ccc tat cca ccc cca ccc acg ccc cac age 4813 
Leu Asn Ser Asp Ser Asp Pro Tyr Pro Pro Pro Pro Thr Pro His Ser 

1570 1575 1580 

cag tac ctg teg gcg gag gac age tgc ccg ccc teg ccc gee ace gag 4861 
Gin Tyr Leu Ser Ala Glu Asp Ser Cys Pro Pro Ser Pro Ala Thr Glu 

1585 1590 1595 

agg age tac ttc cat etc ttc ccg ccc cct ccg tec ccc tgc acg gac 4909 
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Arg Ser Tyr Phe His Leu Phe Pro Pro Pro Pro Ser Pro Cys Thr Asp 

\ 

1600 1605 1610 

tea. tec tgacctcggc cgggccactc tggcttctct gtgcccctgt aaatagtttt 
Ser Ser 
1615 

aaatatgaac aaagaaaaaa atatatttta tgatttaaaa aataaatata attgggattt 
taaaaacatg agaaatgtga actgtgatgg ggtgggcagg gctgggagaa ctttgtacag 
tggagaaata tttataaact taattttgta aaaca 

<210> 3 
<2li> 1615 
<212> PRT 

<213> Homo sapiens 
<400> 3 

Met Glu Ala Ala Pro Pro Gly Pro Pro Trp Pro Leu Leu Leu Leu Leu 
1 5 10 15 

Leu Leu Leu Leu Ala Leu Cys Gly Cys Pro Ala Pro Ala Ala Ala Ser 

20 25 30 

Pro Leu Leu Leu Phe Ala Asn Arg Arg Asp Val Arg Leu Val Asp Ala 

35 40 45 

Gly Gly Val Lys Leu Glu Ser Thr lie Val Val Ser Gly Leu Glu Asp 

50 55 60 

Ala Ala Ala Val Asp Phe Gin Phe Ser Lys Gly Ala Val Tyr Trp Thr 
65 70 1 75 80 

Asp Val Ser Glu Glu Ala lie Lys Gin Thr Tyr Leu Asn Gin Thr Gly 

85 90 95 

Ala Ala Val Gin Asn Val Val lie Ser Gly Leu Val Ser Pro Asp Gly 
100 105 110 
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I. 

Leu Ala Cys Asp Trp Val Gly Lys Lys Leu Tyr Trp Thr Asp Ser Glu 

\ 

115 120 125 

Thr Asn Arg lie Glu Val Ala Asn Leu Asn Gly Thr Ser Arg Lys Val 

130 135 140 

Leu Phe Trp Gin Asp Leu Asp Gin Pro Lys Ala lie Ala Leu Asp Pro 
145 150 155 160 

Ala His Gly Tyr Met Tyr Trp Thr Asp Trp Gly Glu Thr Pro Arg He 

165 170 175 

Glu Arg Ala Gly Met Asp Gly Ser Thr Arg Lys He He Val Asp. Ser 

180 185 190 

Asp He Tyr Trp Pro Asn Gly Leu Thr He Asp Leu Glu Glu Gin Lys 

195 200 205 

Leu Tyr Trp Ala Asp Ala Lys Leu Ser Phe He His Arg Ala Asn Leu 

210 215 220 

Asp Gly Ser Phe Arg Gin Lys Val Val Glu Gly Ser Leu Thr His Pro 
225 230 235 240 

Phe Ala Leu Thr Leu Ser Gly Asp Thr Leu Tyr Trp Thr Asp Trp Gin 

245 250 255 

Thr Arg Ser He His Ala Cys Asn Lys Arg Thr Gly Gly Lys Arg Lys 

260 265 270 

Glu He Leu Ser Ala Leu Tyr Ser Pro Met Asp He Gin Val Leu Ser 

275 280 285 

Gin Glu Arg Gin Pro Phe Phe His Thr Arg Cys Glu Glu Asp Asn Gly 

290 295 300 

Gly Trp Ser His Leu Cys Leu Leu Ser Pro Ser Glu Pro Phe Tyr Thr 
305 310 315 320 

Cys Ala Cys Pro Thr Gly Val Gin Met Gin Asp Asn Gly Arg Thr Cys 

325 330 335 

Lys Ala Gly Ala Glu Glu Val Leu Leu Leu Ala Arg Arg Thr Asp Leu 



340 

Arg Arg lie Ser 
355 

Val Asp Asp lie 
370 

Gly Tyr Val Tyr 
3 85 

Tyr Leu Asp Gly 

Asp Pro Asp Gly 
420 

Thr Asp Thr Gly 
435 

Ser Arg Lys lie 
450 

Ala Leu His Pro 
465 

Asn Pro Lys lie 

Leu Val Asn Ala 
500 

Gin Glu Gly Lys 
515 

Val lie Asn Val 
53 0 

Leu Pro His lie 
545 

Thr Asp Trp Gin 



Leu Asp Thr Pro 
360 

Arg His Ala lie 
375 

Trp Thr Asp Asp 
390 

Ser Gly Ala Gin 
405 

lie Ala Val Asp 

Thr Asp Arg lie 
440 

Leu Val Ser Glu 
455 

Val Met Gly Leu 
470 

Glu Cys Ala Asn 
485 

Ser Leu Gly Trp 

Leu Tyr Trp Gly 
520 

Asp Gly Thr Lys 
535 

Phe Gly Phe Thr 
550 

Arg Arg Ser lie 
565 
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345 

Asp Phe Thr Asp 

Ala lie Asp Tyr 
380 

Glu Val Arg Ala 
395 

Thr Leu Val Asn 

410 

Trp Val Ala Arg 
425 

Glu Val Thr Arg 

Asp Leu Asp Glu 
460 

Met Tyr Trp Thr 
475 

Leu Asp Gly Gin 
490 

Pro Asn Gly Leu 
505 

Asp Ala Lys . Thr 

Arg Arg Thr Leu 
540 

Leu Leu Gly Asp 
555 

Glu Arg Val His 
570 



350. - 
He Val Leu Gin 
365 

Asp Pro Leu Glu 

He Arg Arg Ala 
400 

Thr Glu He Asn 

415 

Asn Leu Tyr Trp 
430 

Leu Asn Gly Thr 
445 

Pro Arg Ala He 

Asp Trp Gly Glu 
480 

Glu Arg Arg Val 
495 

Ala Leu Asp Leu 
510 

Asp Lys He Glu 
525 

Leu Glu Asp Lys 

Phe He Tyr Trp 
560 

Lys Val Lys Ala 
575 
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Ser Arg Asp Val He He Asp Gin Leu Pro Asp Leu Met Gly Leu Lys 

580 585 590 

Ala Val Asn Val Ala Lys Val Val Gly Thr Asn Pro Cys Ala Asp Arg 

. 595 600 605 

Asn Gly Gly Cys Ser His Leu Cys Phe Phe Thr Pro His Ala Thr Arg 

610 615 620 

Cys Gly Cys Pro lie Gly Leu Glu Leu Leu Ser Asp Met Lys Thr Cys 

625 630 635 640 

He Val Pro Glu Ala Phe Leu Val Phe Thr Ser. Arg Ala Ala lie His 

645 650 655 

Arg He Ser Leu Glu Thr Asn Asn Asn Asp Val Ala He Pro Leu Thr 

660 665 670 

Gly Val Lys Glu Ala Ser Ala Leu Asp Phe Asp Val Ser Asn Asn His 

675 680 685 

He Tyr Trp Thr Asp Val Ser Leu Lys Asn lie Ser Arg Ala Phe Met 

690 695 700 

Asn Gly Ser Ser Val Glu His Val Val Glu Phe Gly Leu Asp Tyr Pro 
705 710 715 720 

Glu Gly Met Ala Val Asp Trp Met Gly Lys Asn Leu Tyr Trp Ala Asp 

725 730 735 

Thr Gly Thr Asn Arg He Glu Val Ala Arg Leu Asp Gly Gin Phe Arg 

740 745 750 

Gin Val Leu Val Trp Arg Asp Leu Asp Asn Pro Arg Ser Leu Ala Leu 

755 760 765 

Asp Pro Thr Lys Gly Tyr lie Tyr Trp Thr Glu Trp Gly Gly Lys Pro 

770 775 780 

Arg He Val Arg Ala Phe Met Asp Gly Thr Asn Cys Met Thr Leu Val 
785 790 795 800 

Asp Lys Val Gly Arg Ala Asn Asp Leu Thr He Asp Tyr Ala Asp Gin 
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805 810 815 

Arg Leu Tyr Trp Thr Asp Leu Asp Thr Asn Met lie Glu Ser Ser Asn 
820 825 830 

Met Leu Gly Gin Glu Arg Val Val lie Ala Asp Asp Leu Pro His Pro 

835 840 845 

Phe Gly Leu Thr Gin Tyr Ser Asp Tyr lie Tyr Trp Thr Asp Trp Asn 

850 855 860 

Leu His Ser lie Glu Arg Ala Asp Lys Thr Ser Gly Arg Asn Arg Thr 
865 870 875 880 

Leu He Gin Gly His Leu Asp Phe Val Met Asp He Leu Val Phe His 

885 890 895 

Ser Ser Arg Gin Asp Gly Leu Asn Asp Cys Met His Asn Asn Gly Gin 

900 905 910 

Cys Gly Gin Leu Cys Leu Ala He Pro Gly Gly His Arg Cys Gly Cys 

915 920 . 925 

Ala Ser His Tyr Thr Leu Asp Pro Ser Ser Arg Asn Cys Ser Pro Pro 

930 935 940 

Thr Thr Phe Leu Leu Phe Ser Gin Lys Ser Ala He Ser Arg Met He 
945 950 955 960 

Pro Asp Asp Gin His Ser Pro Asp Leu He Leu Pro Leu His Gly Leu 

965 970 975 

Arg Asn Val Lys Ala He Asp Tyr Asp Pro Leu Asp Lys Phe He Tyr 

. 980 985 990 

Trp Val Asp Gly Arg Gin Asn He Lys Arg Ala Lys Asp Asp Gly Thr 

995 1000 1005 

Gin Pro Phe Val Leu Thr Ser Leu Ser Gin Gly Gin Asn Pro Asp Arg 

1010 1015 1020 

Gin Pro His Asp Leu Ser He. Asp He Tyr Ser Arg Thr Leu Phe Trp 
1025 1030 1035 1040 
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Thr Cys Glu Ala Thr Asn Thr He Asn Val His Arg Leu Ser Gly Glu 

1045 1050 1055 

Ala Met Gly Val Val Leu Arg Gly Asp Arg Asp Lys Pro Arg Ala He 

1060 1065 1070 

Val Val Asn Ala Glu Arg Gly Tyr Leu Tyr Phe Thr Asn Met Gin Asp 

1075 1080 1085 

Arg Ala Ala Lys He Glu Arg Ala Ala Leu Asp Gly Thr Glu Arg Glu 

1090 1095 HOO 

Val Leu Phe Thr Thr Gly Leu He Arg Pro Val Ala Leu Val Val Asp 
1105 . 1110 H15 H20 

Asn Thr Leu Gly Lys Leu Phe Trp Val Asp Ala Asp Leu Lys Arg He 

1125 1130 H35 

Glu Ser Cys Asp Leu Ser Gly Ala Asn Arg Leu Thr Leu Glu Asp Ala 

1140 1145 1150 

Asn He Val Gin Pro Leu Gly Leu Thr He Leu Gly Lys His Leu Tyr 

1155 H60 1165 

Trp He Asp Arg Gin Gin Gin Met He Glu Arg Val Glu Lys Thr Thr 

1170 H75 1180 

Gly Asp Lys Arg Thr Arg lie Gin Gly Arg Val Ala His Leu Thr Gly 
1185 H90 1195 1200 

lie His Ala Val Glu Glu Val Ser Leu Glu Glu Phe Ser Ala His Pro 

1205 1210 1215 

Cys Ala Arg Asp Asn Gly Gly Cys Ser His He Cys He Ala Lys Gly 

1220 1225 1230 

Asp Gly Thr Pro Arg Cys Ser Cys Pro Val His Leu Val Leu Leu Gin 

1235 1240 1245 

Asn Leu Leu Thr Cys Gly Glu Pro Pro Thr Cys Ser Pro Asp Gin Phe 

1250 1255 1260 

Ala Cys Ala Thr Gly Glu lie Asp Cys He Pro Gly Ala Trp Arg Cys 
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1265 1270 1275 1280 

Asp Gly Phe Pro Glu Cys Asp Asp Gin Ser Asp Glu Glu Gly Cys Pro 

1285 1290 1295 

Val Cys Ser Ala Ala Gin Phe Pro Cys Ala Arg Gly Gin Cys Val Asp 

1300 1305 1310 

Leu Arg Leu Arg Cys Asp Gly Glu Ala Asp Cys Gin Asp Arg Ser Asp 

1315 1320 1325 

Glu Val Asp Cys Asp Ala He Cys Leu Pro Asn Gin Phe Arg Cys Ala 

1330 1335 1340 

Ser Gly Gin Cys Val Leu lie Lys Gin Gin Cys Asp Ser Phe Pro Asp 
1345 1350 1355 1360 

Cys lie Asp Gly Ser Asp Glu Leu Met Cys Glu lie Thr Lys Pro Pro 

1365 1370 1375 

Ser Asp Asp Ser Pro Ala His Ser Ser Ala He Gly Pro Val He Gly 

1380 1385 1390 

He He Leu Ser Leu Phe Val Met Gly Gly Val Tyr Phe Val Cys Gin 

1395 1400 1405 

Arg Val Val Cys Gin Arg Tyr Ala Gly Ala Asn Gly Pro Phe Pro His 

1410 1415 1420 

Glu Tyr Val Ser Gly Thr Pro His Val Pro Leu Asn Phe He Ala Pro 
1425 1430 1435 1440 

Gly Gly Ser Gin His Gly Pro Phe Thr Gly lie Ala Cys Gly Lys Ser 

1445 : 1450 1455 

Met Met Ser Ser Val Ser Leu Met Gly Gly Arg Gly Gly Val Pro Leu 

1460 . 1465 1470 

Tyr Asp Arg Asn His Val Thr Gly Ala Ser Ser Ser Ser Ser Ser Ser 

1475 1480 1485 

Thr Lys Ala Thr Leu Tyr Pro Pro He Leu Asn Pro Pro Pro Ser Pro 

1490 1495 1500 
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Ala Thr Asp Pro Ser Leu Tyr Asn Met Asp Met Phe Tyr Ser Ser Asn 
1505 1510 1515 1520 

He Pro Ala Thr Ala Arg Pro Tyr Arg Pro Tyr lie He Arg Gly Met 

1525 1530 1535 

Ala Pro Pro Thr Thr Pro Cys Ser Thr Asp Val Cys Asp Ser Asp Tyr 

1540 1545 1550 

Ser Ala Ser Arg Trp Lys Ala Ser Lys Tyr Tyr Leu Asp Leu Asn Ser 

1555 1560 1565 

Asp Ser Asp Pro Tyr Pro Pro Pro Pro Thr Pro His Ser Gin Tyr Leu 
1570 1575 1580 

Ser Ala Glu Asp Ser Cys Pro Pro Ser Pro Ala Thr Glu Arg Ser Tyr 

1585 1590 1595 1600 

Phe His Leu Phe Pro Pro Pro Pro Ser Pro Cys Thr Asp Ser Ser 

1605 1610 1615 

<210> 4 
<211> 1615 
<212> PRT 

<213> Homo sapiens 
<400> 4 

Met Glu Ala Ala Pro Pro Gly Pro Pro Trp Pro Leu Leu Leu Leu Leu 

1 5 10 15 

Leu Leu Leu Leu Ala Leu Cys Gly Cys Pro Ala Pro Ala Ala Ala Ser 

20 25 30 

Pro Leu Leu Leu Phe Ala Asn Arg Arg Asp Val Arg Leu Val Asp Ala 

35 40 45 

Gly Gly Val Lys Leu Glu Ser Thr He Val Val Ser Gly Leu Glu Asp 
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50 55 60 

Ala Ala Ala Val Asp Phe Gin Phe Ser Lys Gly Ala Val Tyr Trp Thr 
65 70 75 80 

Asp Val Ser Glu Glu Ala He Lys Gin Thr Tyr Leu Asn Gin Thr Gly 

85 90 95 

Ala Ala Val Gin Asn Val Val lie Ser Gly Leu Val Ser Pro Asp Gly 

100 105 110 

Leu Ala Cys Asp Trp Val Gly Lys Lys Leu Tyr Trp Thr Asp Ser Glu 

115 120 125 

Thr Asn Arg lie Glu Val Ala Asn Leu Asn Gly Thr Ser Arg Lys Val 

130 135 140 

Leu Phe Trp Gin Asp Leu Asp Gin Pro Lys Ala lie Ala Leu Asp Pro 
145 150 155 160 

Ala His Gly Tyr Met Tyr Trp Thr Asp Trp Val Glu Thr Pro Arg He 
165 170 175 

. Glu Arg. Ala Gly Met Asp Gly Ser Thr Arg Lys lie He Val Asp Ser 
180 185 190 

Asp lie Tyr Trp Pro Asn Gly Leu Thr lie Asp Leu Glu Glu Gin Lys 

195 200 205 

Leu Tyr Trp Ala Asp Ala Lys Leu Ser Phe He His Arg Ala Asn Leu 

210 215 220 

Asp Gly Ser Phe Arg Gin Lys Val Val Glu Gly Ser Leu Thr His Pro 

225 230 235 240 

Phe Ala Leu Thr Leu Ser Gly Asp Thr Leu Tyr Trp Thr Asp Trp Gin 

245 250 255 

Thr Arg Ser lie His Ala Cys Asn Lys Arg Thr Gly Gly Lys Arg Lys 

260 265 270 

Glu He Leu Ser Ala Leu Tyr Ser Pro Met Asp He Gin Val Leu Ser 
275 280 285 
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Gin Glu Arg Gin Pro Phe Phe His Thr Arg Cys Glu Glu Asp Asn Gly 

290 295 300 

Gly Trp Ser His Leu Cys Leu Leu Ser Pro Ser Glu Pro Phe Tyr Thr 
305 310 315 320 

Cys Ala Cys Pro Thr Gly Val Gin Met Gin Asp Asn Gly Arg Thr Cys 

325 330 335 

Lys Ala Gly Ala Glu Glu Val Leu Leu Leu Ala Arg Arg Thr Asp Leu 

340 345 350 

Arg Arg lie Ser Leu Asp Thr Pro Asp Phe Thr Asp He Val Leu Gin 

355 360 365 

Val Asp Asp He Arg His Ala He Ala He Asp Tyr Asp Pro Leu Glu 

370 375 380 

Gly Tyr Val Tyr Trp Thr Asp Asp Glu Val Arg Ala He Arg Arg Ala 
385 390 395 400 

Tyr Leu Asp Gly Ser Gly ; Ala Gin Thr Leu Val Asn Thr Glu He Asn 

405 410 415 

Asp Pro Asp Gly He Ala Val Asp Trp Val Ala Arg Asn Leu Tyr Trp 

420 - 425 430 

Thr Asp Thr Gly Thr Asp Arg He Glu Val Thr Arg Leu Asn Gly Thr 

435 440 445 

Ser Arg Lys lie Leu Val Ser Glu Asp Leu Asp Glu Pro Arg Ala He 

450 455 460 

Ala Leu His Pro Val Met Gly Leu Met Tyr Trp Thr Asp Trp Gly Glu 
465 470 475 480 

Asn Pro Lys lie Glu Cys Ala Asn Leu Asp Gly Gin Glu Arg Arg Val 

485 490 495 

Leu Val Asn Ala Ser Leu Gly Trp Pro Asn Gly Leu Ala Leu Asp Leu 

500 505 510 

Gin Glu Gly Lys Leu Tyr Trp Gly Asp Ala Lys Thr Asp Lys He Glu 
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515 520 
Val He Asn Val Asp Gly Thr Lys 

530 535 
Leu Pro His lie Phe Gly Phe Thr 
545 550 
Thr Asp Trp Gin Arg Arg Ser He 
565 

Ser Arg Asp Val lie lie Asp Gin 
580 

Ala Val Asn Val Ala Lys Val Val 
595 600 

Asn Gly Gly Cys Ser His Leu Cys 

610 615 
Cys Gly Cys Pro He Gly Leu Glu 
625 630 
lie Val Pro Glu Ala Phe Leu Val 
645 

Arg He Ser Leu Glu Thr Asn Asn 
660 

Gly Val Lys Glu Ala Ser Ala Leu 
675 680 
He Tyr Trp Thr Asp Val Ser Leu 

690 695 
Asn Gly Ser Ser Val Glu His Val 
705 710 
Glu Gly Met Ala Val Asp Trp Met 
725 

Thr Gly Thr Asn Arg He Glu Val 
740 



525 

Arg Arg Thr Leu Leu Glu Asp Lys 
540 

Leu Leu Gly Asp Phe He Tyr Trp 
555 560 
Glu Arg Val His Lys Val Lys Ala 

.570 575 
Leu Pro Asp Leu Met Gly Leu Lys 
585 590 
Gly Thr Asn Pro Cys Ala Asp Arg 
605 

Phe Phe Thr Pro His Ala Thr Arg 

620 

Leu Leu Ser Asp Met Lys Thr Cys 
635 640 
Phe Thr Ser Arg Ala Ala He His 

650 655 
Asn Asp Val Ala He Pro Leu Thr 
665 670 
Asp Phe Asp Val Ser Asn Asn His 
685 

Lys Asn lie Ser Arg Ala Phe Met 
700 

Val Glu Phe Gly Leu Asp Tyr Pro 
715 720 
Gly Lys Asn Leu Tyr Trp Ala Asp 

730 735 
Ala Arg Leu Asp Gly Gin Phe Arg 
745 750 
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Gin Val Leu Val Trp Arg Asp Leu Asp Asn Pro Arg Ser Leu Ala Leu 

755 760 765 

Asp Pro Thr Lys Gly Tyr He Tyr Trp Thr Glu Trp Gly Gly Lys Pro 

770 . 775 780 

Arg He Val Arg Ala Phe Met Asp Gly Thr Asn Cys Met Thr Leu Val 
785 790 795 800 

Asp Lys Val Gly Arg Ala Asn Asp Leu Thr He Asp Tyr Ala Asp Gin 

805 810 815 

Arg Leu Tyr Trp Thr Asp Leu Asp Thr Asn Met He Glu Ser Ser Asn 

820 825 830 

Met Leu Gly Gin Glu Arg Val Val He Ala Asp Asp Leu Pro His Pro 

835 840 845 

Phe Gly Leu Thr Gin Tyr Ser Asp Tyr He Tyr Trp Thr Asp Trp Asn 

850 855 860 

Leu His Ser lie Glu Arg Ala Asp Lys Thr Ser Gly Arg Asn Arg Thr 
865 ; 870 875 880 

Leu He Gin Gly His Leu Asp Phe Val Met Asp He Leu Val Phe His 

885 890 895 

Ser Ser Arg Gin Asp Gly Leu Asn Asp Cys Met His Asn Asn Gly Gin 

900 905 910 

Cys Gly Gin Leu Cys Leu Ala He Pro Gly Gly His Arg Cys Gly Cys 

915 920 925 

Ala Ser His Tyr Thr Leu Asp Pro Ser Ser Arg Asn Cys Ser Pro Pro 

930 935 940 

Thr Thr Phe Leu Leu Phe Ser Gin Lys Ser Ala He Ser Arg Met He 
945 950 955 960 

Pro Asp Asp Gin His Ser . Pro Asp Leu lie Leu Pro Leu His Gly Leu 

965 970 975 

Arg Asn Val Lys Ala lie Asp Tyr Asp Pro Leu Asp Lys Phe He Tyr 
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980 985 990 

Trp Val Asp Gly Arg Gin Asn lie Lys Arg Ala Lys Asp Asp Gly Thr 

995 . 1000 1005 

Gin Pro Phe Val Leu Thr Ser Leu Ser Gin Gly Gin Asn Pro Asp Arg 

1010 1015 1020 

Gin Pro His Asp Leu Ser lie Asp He Tyr Ser Arg Thr Leu Phe Trp 
1025 1030 1035 1040 

Thr Cys Glu Ala Thr Asn Thr lie Asn Val His Arg Leu Ser Gly Glu 
"= . . 1045 1050 1055 

Ala Met Gly Val Val Leu Arg Gly Asp Arg Asp Lys Pro Arg Ala lie 

1060 1065 1070 

Val Val Asn Ala Glu Arg Gly Tyr Leu Tyr Phe Thr Asn Met Gin Asp 

1075 1080 1085 

Arg Ala Ala Lys lie Glu Arg Ala Ala Leu Asp Gly Thr Glu Arg Glu 

. 1090 1095 HOO 

Val Leu Phe Thr Thr Gly Leu He Arg Pro Val Ala Leu Val Val Asp 
1105 1H0 1H5 H20 

Asn Thr Leu Gly Lys Leu Phe Trp Val Asp Ala Asp Leu Lys Arg He 

1125 H30 H35 

Glu Ser Cys Asp Leu Ser Gly Ala Asn Arg Leu Thr Leu Glu Asp Ala 

1140 H45 H50 

Asn lie Val Gin Pro Leu Gly Leu Thr He Leu Gly Lys His Leu Tyr 

1155- H60 H65 

Trp lie Asp Arg Gin Gin Gin Met lie Glu Arg Val Glu Lys Thr Thr 

1170 H75 H80 

Gly Asp Lys Arg Thr Arg He Gin Gly Arg Val Ala His Leu Thr Gly 
1185 1190 H95 1200 

He His Ala Val Glu Glu Val Ser Leu Glu Glu Phe Ser Ala His Pro 
1205 1210 1215. 
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Cys Ala Arg Asp Asn Gly Gly Cys Ser His He Cys He Ala Lys Gly 

1220 / 1225 1230 

Asp Gly Thr Pro Arg Cys Ser Cys Pro Val His Leu Val Leu Leu Gin 

: 1235 1240 1245 

Asn Leu Leu Thr Cys Gly Glu Pro Pro Thr Cys Ser Pro Asp Gin Phe 

1250 1255 1260 

Ala Cys Ala Thr Gly Glu He Asp Cys He Pro Gly Ala Trp Arg Cys 
1265 1270 1275 1280 

Asp Gly Phe Pro Glu Cys Asp Asp Gin Ser Asp Glu Glu Gly Cys Pro 

1285 1290 1255 

Val Cys Ser Ala Ala Gin Phe Pro Cys Ala Arg Gly Gin Cys Val Asp 

1300 1305 1310 

Leu Arg Leu Arg Cys Asp Gly Glu Ala Asp Cys Gin Asp Arg Ser Asp 

1315 1320 1325 

Glu Val Asp Cys Asp Ala He Cys Leu Pro Asn Gin Phe Arg Cys Ala 

1330 1335 1340 

Ser Gly Gin Cys Val Leu He Lys Gin Gin Cys Asp Ser Phe Pro Asp 
1345 1350 1355 1360 

Cys He Asp Gly Ser Asp Glu Leu Met Cys Glu He Thr Lys Pro Pro 

1365 1370 1375 

Ser Asp Asp Ser Pro Ala His Ser Ser Ala He Gly Pro Val He Gly 

1380 1385 1390 

He He Leu Ser Leu Phe Val Met Gly Gly Val Tyr Phe Val Cys Gin 

1395 1400 1405 

Arg Val Val Cys Gin Arg Tyr Ala Gly Ala Asn Gly Pro Phe Pro His 

1410 1415 1420 

Glu Tyr Val Ser Gly Thr Pro His Val Pro Leu Asn Phe He Ala Pro 
1425 - 1430 1435 1440 

Gly Gly Ser Gin His Gly Pro Phe Thr Gly He Ala Cys Gly Lys. Ser 
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1445 1450 ^ 1455 

Met Met Ser Ser Val Ser Leu Met Gly Gly Arg Gly Gly Val Pro Leu 

1460 1465 1470 

Tyr Asp Arg Asn His Val Thr Gly Ala Ser Ser Ser Ser Ser Ser Ser 

1475 1480 1485 

Thr Lys Ala Thr Leu Tyr Pro Pro lie Leu Asn Pro Pro Pro Ser Pro 

1490 1495 1500 

Ala Thr Asp Pro Ser Leu Tyr Asn Met Asp Met Phe Tyr Ser Ser Asn 
1505 1510 1515 1520 

He Pro Ala Thr Ala Arg Pro Tyr Arg Pro Tyr He He Arg Gly Met 

1525 1530 1535 

Ala Pro Pro Thr Thr Pro Cys Ser Thr Asp Val Cys Asp Ser Asp Tyr 

1540 1545 : 1550 

Ser Ala Ser Arg Trp Lys Ala Ser Lys Tyr Tyr Leu Asp Leu Asn Ser 

1555 1560 1565 

Asp Ser Asp Pro Tyr Pro Pro Pro Pro Thr Pro His Ser Gin Tyr Leu 

1570 1575 1580 

Ser Ala Glu Asp Ser Cys Pro Pro Ser Pro Ala Thr Glu Arg Ser Tyr 
1585 1590 1595 1600 

Phe His Leu Phe Pro Pro Pro Pro Ser Pro Cys Thr Asp Ser Ser 
1605 1610 1615 



<210> 5 

<211> 3096 

<212> DNA 

<213> Homo sapiens 
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60 

120 

180 

240 

300 

3 60 



catcttctca cacgatctct cgcttcgcac tccttccttt gattggtttt caccatttac 
tcagacgacg gtccttcttc gatctttgca cattcttcta tcatctacta ccttcatacc 
cagctccgtc ccctaatatt catgcgcgga tggcccattc cgtggtgaaa attcccttct 
actctgctaa tctgctgttc tctctccctc ccgtcgggtt ctgctcctgc cacgttctcc 
cctctcccca ccaaaggctg ggttttcttt gtcagggctc ctttcccctt tggaagaagg 
ggggctgtat ggccttggtg cgaggccctc . cagtgacagg atcccccatc acccagagtt 
ccacaggccc tggtagggag gagggggagc agaagaggag gtgccatctt tgcctgctgg 420 
ggaagggcag gggccaccca cacagagctc tcccatttgc tgtggaccct ggggccactg .480 
cccagttcct tccaaaggaa agccagctcc ccaggtggtg ggagagtgat atggcttcct 540 
cttaaactta gggaattgag tgtgtggttg cttctaagtg ccttagaagc cgggagcggc 600 
tcctggaaag agcctgcctg ccacagcggg ccttaccctg gctgtgccca cagatgtccc 660 
tggggcctgc cgctcctgcc cggctctcct ggcctccccc ggtgtgggtt gggaaaagca 720 
. cagcaaatta aaaaacacct ccatctctgg cctttgaaga atgcatctga acagccgaga 
gtgtaaaccg tggtgaaatg tggtctttcc agtttgggga gaagcagggc agagctgggg 
cttttgtacc cagggtttcc aagagctcct gcctccctcg gctgggctgg ccagggcccc 
ccgctgggac ctccagctgt aatagggaag gttttactgg gttgctggcc actgtggact 960 
gcccctaagg gcaggtatgc ctgcctttac ccgggttccc ctcctgcctg gaagatacag 1020 
cccatgggag gcctgttgtc tgtgggatcc tccagcatca gagacactgg ggccagcgtc 1080 
tgcctggtga ggtgcaggcc tggcaggccc ggtcccccac ctgcttgagc acccacggtg 
gtgggggctc gctgcctccc gagacaatct atgtcattgt tgtccaagga agctaattta 
gagtagaaag ttccgtgtcc agtcccactc tgtgcgtgtg ttagcagggg actctcgggc 
cggagctggg tccaccctgg tagggggact tcatggggcc tgggcgacag cactgtgtat 1320 
ttgtgtgtgt gtgtgtttgt gtgtgtgtgt gtctgaggag gtggaccagt ttctcaaaag 1380 
gcctgtgacc ccaagaacca aggaatttca gcctgggtgg atcacacctt cactggtgag 1440 
tgggacaagc tgggggccct cgccacagga gcagccaggg catggggcac agttggcctc 
attcacaaaa tgggagtata agtgatccct gctctggcgg ccaggacgat gagtgggaac 
acaccgtgtg ggggctgcct ggcctgggtg tgccgcgggt gtccttgttg gtgatggttc 
cacctgcttg tgccaccagt gccctctggg tctcacacac aactctcttc ccagcgaagg 
cccctcctgc cctcaggcct cagtgctgct tccgtctcgg aaggccccag gagctcctgc 



780 
840 
900 



1140 
1200 
1260 



1500 
1560 
1620 
1680 
1740 
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atcctgggcg tgattcctgt gtgcctgcag accccctcgc ggctgccatc 


tcatcctttg 

\ 


1800 


gtgcacctgt tggccagacc tcctggtagc gggtgctgca 


ctcccctgaa 


tgtgccgggg 


1860 


cctgggggca gggacctggg ctcctccctc actgagtgga gggaactcag tgtcttggag 


1920 


ttggggtgcc tgcaggctgg gtggtgcagg tgaaatgcag 


acctctcagc 


tggtgttcca 


1980 


gagcagctgc ■ cttcccccgc ccgagggact tcacccgcag cccagtcagg ggtggcgcct 


2040 


gggtgcatcg cccgcaggct gggtaggggt ggagcctggg 


tggccctgcc 


tgtgagctgc 


2100 


atagttgtcg cctttgaccc tgagttttct tcgttatctg 


tttggacctg 


tttggggcag 


2160 


gcaggggatg agatctgaag ataaatgcct tagctgtgac 


catctccttt 


tgtgagaggt 


2220 


caatgtccag ttccgctgca gttataacat cccatttttt 


gatttctttt 


tattttttcc 


2280 


tttttctttt tgagatggag tctcgctctg tcacccaggc 


tggagtgcaa 


tggggtgacc 


2340 


tcagctcact gcaacctcca cttctcgggt tcaagtgatt 


ctcctgcctc 


agcctcctga 


24 00 


ctagcagggg ttacaggcgt gagccaccac gcccagctaa 


tttttgtatt 


tttagtagag 


2460 


gcaaggtttc gtcatgttgg ccaggctggt ctcaaactcc 


tggccttaag 


tgatctgccc 


2520 


gcctcggcct cccaaagtgc tgagatgaca ggtgtgagcc 


accgtgcccg gcccagaact 


2580 


CCuUaaCt.CC CaCvtyaaaL i-^y^^y^^ 1 ' 1 ' 


ccagtctccc 


tcccctagtc 


2640 


cctggtccca ccattctgct ttctgtctca atgaatttgc 


ctaccgtaag 


tacctcatat 


2700 


aaattgaatc ataaagtatt tgtcttttta tatctggctt 


atttcactta gcataacatt 


2760 


cttaagtttc atccatgttg tagcatgtgt cagaatctct 


ctcttttttt 


tttttttttt 


2820 


tttttttttt ttttgcagac agagtctcgc tctgtcatct 


agactggagt 


tcagtggcac 


2880 


gatctcggtt cactgcaaca tctgcctcct gggtccaagc 


aattctcctg 


cctcagcctc 


2 940 


cttagcagct ggaactacag gcgcgtgcca ccatgccttg 


ctaatttttg 


tatttttatg 


3000 


tggaggcagg gtttcaccat cttggccagg ctggtctcga 


attcctggtc 


ttcaccacgg 


3060 


gggcccgaag gacccgggca aagcgtggag gggagg 






3096 



<210> 6 

<211> 26928 

<212> DNA 

<213> Homo sapiens 
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<220> v 
<221> unsure 

<222> (12044) , (12489) , (26433) , (26434) , (26435) , (26436) , (26439) , (26441) 
<223> Identity of nucleotide sequences at the above locations are unknown. 



<400> 6 



rraanarpaart rrr*f-a r a raac aaaCfCaCfttt CaqqqCqqqC aqCCtqqqgC 
y ctciy cid— ddy . 33 detect v_o.yv- vwi j33 w 33j w 3 u —> 


cccacggggc 


60 


n arft-at"fctc acctatQQaq qqcaqaqaaq qqaacaggga 


gagaagtggc 


120 


^ggctgggag tggagguggg ttugagyuLL. ucto uy uctaa^ uaoaty^ua 


ccctctacct 


180 


tagttatgaa ttatgagaca cgaaya.ct.yt- yctctctuctyctuct v_a.^^k_^u.^v_«. 


aaagtgcctc 


240 


taggctgaca gggagaaagt cccgccaggc tcccagacgc cacctttgag 


tccttcaaca 


300 


agcccgccag ggcctcttgc ccaccggtgt cagctcagcc actgaaccct 


ccaggaagaa 


360 


gacgtgctgg taggagaaga atctcaccca ggcacagcct ggaaggggca 


cagaaggggc 


420 


tccggaacca gcaagcccaa gttggaactc ccagtctgct actttctaga 


acgactgtgc 


480 


ccttggcggg tctaagtaga acctctccgc gcactctttc ctcctttgta 


aagtggggac 


540 


ageaatggee accttgeagg ttcagagagg gettgeagta cctcacagaa 


ctgagtgccc 


600 


gtgaacgtgt gtgttcctcc agatttgtga cagctttgcc aggctggagt 


caggctgaac 


660 


gcctctgccc tcatggggtt tatattctag gaagaccaac aaaaacaaga 


agaeggaaaa 


720 


ttaaaacaac aaaagcccca ttgacaggee gtgaagaatg ccatgaaaaa 


tgaatggcgt 


780 


tgtgctgcag tctttgggga aacgggctta eggaaagaag gacacttgag ctgctaccaa 


840 


tgagcagccg tecggtggga gggcagttca ggaagagcag acatccactg 


aggaggeget 


900 


ggggcagagg gcagcctggt cgctggattc gggggaggaa ccacatcagg 


ccatgagctg 


960 


gagctggtgg tagaatgtac aggagaggee agccagggcc agctcatgtc 


agacctcaag 


1020 


eggggaagat gaatcgagaa tgcaccccac gagcaatggg aagecagtet 


acgatttaag 


1080 


cagcaaaaat attttccctt cttccaccct gcatccagct ctaccagcac 


agcctggggt 


1140 


tctattttca agatagaata gacccagact cccagctctt cttacacttc 


tactactgcc 


1200 


acctgtcacc cactcatgcg tccccacttg cagcctcgac ccccttccac 


ctgatctcat 


1260 


ggcagecagg gaagctccag ggctcgtgag ggctgccatc tcaggaaaga ageaaaagee 


1320 


ttcggcacct geagggectg ctccaaccac acttcttcct tgacctctca gcttccttag 


1380 
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ccactccctt 


cccacatctc accctgctcc agccacagtg gtgtctctgt gggttctcaa 


144 0 


acacaccagg 


tgcactcctg cctcagggcc tttgtgcttg ctgttctctg 


ctgggactct 


15 00 


tttttttttt 


tttttttttg agacagggtc tcactctgtg gcccaggctg gagtgtagtg 


1560 


gtgtgatcgt 


agctcattgc aacctcaaac tcctgggctc aagcaatcct 


cccacctcag 


1620 


cctctcaagt 


agttagcttt tgttgttttg ttttgagatg ggatctcact 


ctgttgccca 


1680 


ggctggagtg cagtggggca atcttggctc accacaacct ctgcctccca ggctcaagca 


1740 


attctcctgc 


ctcagcctcc caagtagctg ggattacagg catgtgccac 


cacgcccagc 


1800 


ttatttttgt 


atttttagta gagacagggt. ttcaccatgt tggtctggct 


ggtcttgaac 


1860 


tcctggcctc agatgatcca cctgcctcgg cctcccaaag tgctgggatg 


acaggcatga 


1920 


gcctgtctct agtagttagg actacagaga ggggccatca tgcctggtga 


tcctcccacc 


1980 




caactctttc accccactta gcctcgtggc tcactctctt 


acctcttcag 


2040 


U L L 'w .3 


aggcctgagg acccctgttg aaaattgcaa accacacccc 


ccaccaccac 


2100 




tgccagcact ttctactcca tttctctgct ttacttttct 


cctttgtact 


2160 


oat'r'pr'rsrr 


tgactcatta catgtttacg tatctttctt ctctccacta 


gcatggaagc 


2220 


\~ r*Hzs etna nan 
u ^ \^ cty y ciy ay 


cagagagtgt agttttattc cctgatgtgt ttcctgtgcc 


cgtaccaggg 


2280 


cctagcacac 


agtaggtgct cagtaaatgt gtgttggatg aacaaataca gtgaaaggat 


2340 


ctgatctaca 


tttataaaga aggcactctg gctgctgagt ggggatgaga ctgtcaggag 


2400 


gaaagaggcc 


cctgtggggg cctggccagc aggtgggtac aatggtagca 


gccaggagag 


2460 


agggcctctt 


ggactcaagt ggatggggcc tgctcagggc tccggccaca 


ggaacaaagg 


2520 


gaagggggcc 


caggatggcc tgtcatagag gacacattac aactggccca aagttcaagt 


2580 


caggtttcta 


aatttgggaa gggatacaga aaaactaaag actctactgg 


acagtcagtt 


2640 


attgaaa.tga 


ttacatagaa aatgtaccaa gaattaaaaa aaaaaaaaaa 


aagcattatg 


2700 


aaggggccac 


cagagactcc cagagaggaa agggactatg ggctggatgc 


ggtgactcac 


2760 


acctataatc 


ccagcacttt gggaggccga ggagggtgga tcacgaggtc 


aggagttcaa 


2820 


aaccagccta 


ggcaacatgg taaaaccccc gtttctacta aaaatacaaa 


aaattagctg 


2880 


ggcatggcag 


catgtgcctg taatcccagc tactcgggag gctgaggcag gagagttgct 


2940 


agaacccagg 


aggcagaggt tgcagtgagc cgagattgag ccactatgct 


ccagcttggg 


3000 


cgacagagca 


agactccgtc tctaaaaaaa agaaaaaaaa ggccagatga ggtggctcat 


3060 


gcctgtaatc 


ccagcacttt gggaggccga ggtgggtgga tcacgaggtc 


aggagatcga 


3120 
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gaccatcctg gctaacatgg tgaaactcca tctctactta aaatacaaaa aattagccgg 3180 

gcgtggtggc gggcacctgt agtcccagct acttgggagg ctgaggcagg agaatggcgt 3240 

gaacctggga ggcggagctt gcagtgagcc gagattgcgc cactgcactc catccagcct 3300 

gggcgacaga gttagactcc gtctcaaaaa aaaaaaaaaa aaaaaaatta g'ctgattagt 3360 

tgggcttggt ggcgggcgcc tgtaatccca actactcggg aggctgaggc gggagaatca 3420 

cttgaacccg ggaggcagag gttgcaatga gccgatatca cgccactaca ctccagcctg 3480 

ggcgacagag caagactcca tctcaaaaaa gaaaaaaaaa aagaaagggg ctgtgctgtg 3540 

gcctgggacc caaagcacac tactgcaagg tcccagggtg cctgactcca accggagcct 3600 

tgagaacatt catttgcaaa gaatgaatta aaattcagca ctattttatt ctgcaggatt 3660 

ccagcacccc aaggacagtc atttttagac octtcagtaa cgtaataagt aaccggagga 3720 

tgtgctgagc ttccacttcc ccagacggtt gcctgtcaca gctcatcagg ccaacaaact 3780 

tttcttaggc ctcaaatttg gaaatgttca ctctcagttc gttccttaga tgcaagtcca 3840 

tcccaatgaa gtaacagggg ctcagcacct gtccaatctc attgcttccg gggacagggg 3900 

cccatgagga tgtcgtttca gcccggtgac acttgggcaa agtgcctttt ggtttccctc 3960 

ccaggctgga acgtgctggc tctgtgaagt tacgctgggc acaagagccc cccccaaccc 4 02 0 

ggcaggactg actgctgtgg tcagaggcgc ccctggggct ttgggagcca cagaatcttc 4080 

ctgagggcag cgccggagga ggccccagtg agagtgccca ctgccaggct cattcctcag 4140 

gctgccgcag gcctctcccc aaaacaggca atgcttctca gcaacctgcc ccaggagcag 4200 

gccagggaag gccgccatcg gcctacagtg ctgggctctg gagggcttgg ttggtaacag 4260 

gccatggttt ctatgagcca gctggggtgt gaaggacaca ggctggattc acctctctgg 4320 

gcctcagttt ctgcattcaa aaagtgggaa tcatgatatc tgctctattt cttatctctc 4380 

agtgctgatg tgaacctcca ataagacttt taaaaatact ctttctacct tacttttatt 4440 
tttcatttat tttaagataa tgtctagctg tctcacccag gctggagtgc agtggtgtga 4500 
ttacggctca ctacagcctt aacctcccag gctcaagtga tcctcctacc acagcctccc 4560 
aagtagctgg aactacaggc atgcaceacc gcacctggat aattttttct tttgagacaa 4620 
ggtttcactc tgttgcccag gctggagtgc agtggtgcac tcttggctca ctgcagcctc 4680 
aacctccctg ggcttaggtg atcctcacac ttcagtctcc caagtagctg ggactacagg 4740 
tatgtgccag tacacccagc taatattttt gaaggatggg gtttcactat attgcccagg 4800 
ctggtcttga actccagggt ttaagcaatc taccttcctc agcctgccaa agtgctagga 4860 
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ttataggtat 


qagccacccc 


ccggcctata 


atcctaccac 

\ 


tttaaaaaag cctgtaattt 


4920 


tagcacttta 


aaaaattttt 


ctaaattttt tatagagatg ggggacagct gtggtctcac 


4980 


tgtgttgccc 


aggctggtct 


tgaactccta ggatcaagcc 


atcctcctgg 


cctggcctcc 


5040 


caaagtgttg 


ggattataag 


cataagcctt 


accttacctt 


ttttttttga gttgcagttt 


5100 


tgttcttgtt gctcaggctg gagtgcaatg gcaagatctt ggctcactgc 


aacctccacc 


5160 


tcccgggttc 


aagcaattct 


cctgcctcag 


cctcccgagt 


agctgggatt 


acaggcatgc 


5220 


gccaccacac 


ccagctaatt 


ttgtattttt 


agtagagatg gggtttctct 


atatacctta 


52 80 


attttaaagc 


actgcattca 


tgtaaattgt 


gattaacatg 


gattcaagag 


agggagtgag 


5340 


gatgaatgag ccaggcagtc 


acctcggctg 


tcaccctcca 


cttctctcct 


ccttctgaca 


5400 


gtcatcgtcc atccgtttct gcagctgttt gtttgactct 


cctgatcatt 


ttgcttgcca 


5460 


cataacttgc 


ctcctgggaa 


agaatgccct 


gggcaggccc 


acatgagtag 


tgaaaaataa 


552 0 


tctgcagtga 


aaaataaaac 


taagtagtct 


ggtccacaga 


gcagtcttat 


tttttcactg 


5580 


cagatgaagg 


agttgacatt 


caggcttcat 


tctcatttat 


aagtgtttta 


aagacacata 


5640 


cagtggattg 


aacagtggcc 


ttcaaaaaga 


tgtatctaca 


tcctaatccc 


tgggacctgt 


5700 


gaatgttaac 


caagttagga 


aaagggtctt 


cccgggtgtc 


attaagttag 


agatcttgag 


5760 


atgaggagct 


catcgtggat 


tatccaggtg gaccctgcat 


ccaaggacaa 


atggtcctta 


5820 


gaaaagaaaa 


gcagaggctg 


ggcacagtgg 


ctcaagcctg 


taatcccagc 


actttgagag 


5880 


gccgaggtgg gtggatcacc 


taaggtcatg 


agttcgagag 


cagcctggcc 


aacatgatga 


5940 


aatcccatct 


ctactaaaaa 


tacaaaaatt 


agcaaggcat 


ggtggcgggt 


gcctataatc 


6000 


ccagctactc 


aggaagctga ggcaggagaa 


tggcttgcac ctgggaggcg gaggttgcag 


6060 


tgagccaaga 


tcgcgccact 


gcactccagc 


ctgagggaga 


aaagtgaaac 


tctgtctcat 


6120 


aaaagaaaag 


aaaagcagac 


agagatctga 


gacagaagag 


gagagtgaag 


gaaaaaaggc 


6180 


catgtgaaga 


tgaggcagag 


gttggagcca 


tgcagccaca 


agccaaggaa 


tacctggagc 


6240 


cccagaagtt 


gcaagaggta ggaagaagcc 


tcccctagag 


cctccagacg 


gagcacagcc 


63 00 


ctgccaacac 


ctccacctca 


gacttctggc 


ctccagcact 


gtgagataat 


caactgctgt 


6360 


tgttttaagc 


caccagattt 


gtggtaattt 


gttatggcag 


ccacaggaaa 


ctaatacagt 


Ore j£ U 


acctaatctt 


cacaaaccca 


tcttacagaa. aaggaaactg aagtcagaga ggtagtggct 


64 80 


tgtgcagtgt 


gttaggccat 


tcttgtatta 


ctataaagaa 


atacctgagg 


ccgggcatgg 


6540 


tggctcacgc ctgtaatccc agcactttgg gaggccaagg tgagtggatc 


acttgaggtc 


6600 
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6660 
6720 
6780 
6840 



aggagttcaa gaccagcctg 'gacaacatgg tgaaacccca tttctactga aaatatgaaa 
attagccagg catggtggcg tgcatctgta gtcccagcta ctcaggaggc tgaggcagga 
gaatcacttg cgcccgggag gaggaggttg tagtgagcca agattgtgcc actgcactcc 
agcctgggag acaagagaga aaccctgtct caaaataaat aaaaaacaaa taaacacctg 
agactgggta gtttataaag aaaggggtta actggctccc ggttctgcag gctgtacaag 6900 
catggtgccg gcatctgctt ggttgctggg aaggcttcag ggagttttac tcatcgtgga 6960 
aggcagagcc agagcaggtg catcacacag caaaagcagg agcgagagag agagagagca 
■ gggaggtgtg cacactttta aatgagcaga tctcacgaga actcaccatt gcaaggacag 
caccaagcca cgaggggtct gcccccatga cccaaacctc ccactaggcc ccacccccaa 
cattgggaat tacagttcaa catgaggttt ggggggacaa atatccaaac tatatcattc 
cacccctggc cccccagatc tcatgttctt ctcacattgc aaaatatagt catgccttcc 
cagtagcccc ccaaagtctt aactcatccc agcattaact caaaaatccc attcccaagt 7320 
ccaacgtctc atctgaagat gagttccttt cacctacaag actgtaaaaa tgaaaacagt 
tatttactgc tgagatacaa tgggggcata ggcattaggt aaacattcct gttccaaaag 
ggagaaatcg gtcaaaagaa aggggctata ggccccaagc aagtccaaaa cccagcagag 
caatcattca atcttaaagc tccaaaataa cctccttaaa ctccatgtcc catagccagg 7560 
gcacactggt gcaaggggca ggctcccaag gccttgggca gctctattcc tgcggctttg 7620 
cagaattcag tccccatggc tgctcttaca gattggagat gagggcctgc ggcttttcca 
ggtgcagggt gcaagctgct ggtgatctac cattctgggg tgtggatggt ggcggccccg 
tcccgcagct ccactaggca ttgtcccagt ggggactcta tgtggggcct ccaaccccac 
atttcccctc caatgggaag gctctgcccc tgcagcagcc ttcttcctgg gctcccaggc 
tttctcatac atcctctgac atctaggtgg atggtgtcaa gcttccttca ctcttgcact 
ctgcacacct acaggcttaa caccacatgg aagctgccaa ggtgtatggc tggaaccctc 7980 
tgaagcagca gcctgagctg tgactatggc cctttgagcc aaggctggag ctggaacagt 8040 
ctagatgcag gcagggagca gtgtcctgag gctgtgcaga gcagcagggc cctgtgcctg 8100 
gacaatgaaa ccattctttc ctcctcatcc tctgggcctg tgatgggagg gttgtggaag 8160 
atctctgaaa tgcctttgag gcctttttgc ctctgaggcc tatttcctat tgtctcagtt 8220 
attggcagtc ggctcctttt tagttatgca aatcctctag caagaggtta ctccactgcc 8280 
ggcttgaact cctctcctga aaaagctttt tctttctttg tcacatggcc aggctgcaaa 8340 



7020 
7080 
7140 
7200 
7260 



7380 
7440 
7500 



7680 
7740 
7800 
7860 
7920 
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8400 



8580 
8640 



8820 
8880 
8940 
9000 
9060 
9120 



ttttccaaac ttttatgctc . tgttttacct. ttaaatataa cttctaactt taattcattt 

atttgctcct gcatttgagc atagggaatt caaagaagct gggccacatc ttgaatgctt 8460 
tgctgcttca aaatttatgg ccacgcttgg tggctcacac ctgtaatccc agcactttgg 8520 

gaggcctagg tgggcagatc acgagatcag gagatcgaga ccatcctggt caacatggtg 
aaacccatct ctactaaaaa tacaaaaaaa ttagcttggt gtggtggcgc agacctgtag 

. tcccagctac tggagaggct gaggcaggag aattacttga acctgggagg cagaggttgc 8700 
agtgagccca gatcatgcca ctgcactcca gcctggtgac agaataagat ttgatctcga 8760 
aaggaaggaa ggaaggagga agggaagaaa tgtcttcccc ccagatgtcc tgggtcatcc 
ctcttatgtt caaacttcaa cagatcccta gggcatgaaa ataatacagc caaattattt 
gctaaggcat aacgaaagtg ' acctttgctc cagttcccaa taagttcctc atttccatct 
gagactcatc accctggcct tggcttgtcc atatcactgt cagcattttg gtcacaatca 
tttaaccagc taatcgggag gctgaggcaa gaggatcact tgaacccagg aggttgaggc 
tgcagtgagc tgtgatcaca tcactgcagt ccagcttggg caacagagca agatcctgtc 
tcaataaata aataaataaa tacataaata acttaagttt atttaaagct gcatctttgc 9180 
caccatggag aaaggccagg ccagctcctt ctctctttct gcacgtgttc ctcccacctc 9240 
agctgcctct gctcctcaag gaggaacaga gggagtagga aaggccatcc caggaggccc 93 00 
agcaccccaf gacctggctc tggggccttg tgggtttatg gattcccagt gctgagtcat 9360 
ccctcacagg ctcttgtggg caccttggac attggtcaga agcatgtggt ccccgggaac 
acaccttttc ctgatcatct- gggaagggca gcttgtgcca gcgaggccac ctgttcagcg 
ccacggcccg ccagacagct gcagccacag ccttgccttt gatcagagca. aacaccagac 9540 
atgtgtgtca tgcccccaac ccatctccag gggacacatg tcctttcttg ccaggcctga 9600 
gatgaacaag agagggacaa gtccccaagc ctctctctcc ttcctgcctc acccactccg 9660 
ctgttagatt ctcaaggtgg atggtgggct aactagggca accgaccatc ctggtttacc 9720 
tagaactgag ggggcatttt caggaataaa actgcaaaag tctggagcaa acaggagcaa 97 80 
gttggtcact ctggggctgg tggagtcagg tttccttctg caggccccct ccccgcaagc 9840 
atgggtggaa cccaggacag gaacacagag caggccccag gaccgggctt gtcacttaca 
agtctttttt tt-tttttttt ttttgagatg gagtcttgct ctgtcatcag ggctggagta 
cagtggtgcc atcttagctc actgcaacct ctgccttctg ggttcaagtg atccccctgc 
■ctcagcctcc. tgagtagctg ggactacagg tggcaccacc acgcccagct aattttttgt 10080 



9420 
9480 



9900 
9960 
10020 
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atttctagta gagatgagat ggccaggctg gtcttgaact cctgacctca agtgatctgc 10140 



10200 



ccgccttggc ctcccaaagt gctgggatta caggtgtgag ccactgtgcc tggccccact 

cacaagtctt aaaccatgcc tcagcacatc aatgccattt acaaaaaggt agagggattt 102 60 

tccaggcaaa aatagatgaa agacatagga tgattgatca tgtcctgctt aaacataggt 10320 

ctgatgctat taagaattga gggctgggag cggtggctca cgcctgtaat cccagcactt 10380 

tgggaggccg aggcgggcgg atcacgaggt caggagatcg agaccatcct ggctaacacg 10440 

gtgaaacccc atctctacta aaaatacaaa aaatggccgc gcgcggtgac tcacgcctgt 10500 

aatcccagca ctttgggagg ccaaggcggg cggatcacga ggtcaggaga tcgagaccat 10560 

cctggctaac acagtgaagc cccgtctcta ctaaaaaata caaaaaaaat tagccaggca 10620 

tggtggcggg cgcctgtagt cccagcaact tgggaggctg aggcaggaga agaatggtgt 10680 

gaacctggga ggtggagctt ccagtgagcc gagatcacac cactgcactc cagcctgggc 10740 

gacagagtga aactccatct caaaaaaaaa ataaataaat aaataagaat tgttagtatt 10800 

ttgcaggtgt gacaaatgat tctgtttctg tggcagaatg ttctcaggag atctcttttg 10860 

aactctcatg gaaagcatca tgctgttggc aacatcacat ttatttttat ttatttatta 10920 

ttttttagag acagggtctt gctctgttgc ccaggctgga gtgcagtggc acaatcacag 10980 

ctcactgcag cctcaacctc ctgggctcaa gcaatcctcc tgcctcagcc tcccaaagta 11040 

gctgggacca caggcgtgag ccactgcact cagcccaatg taccttcaat atttacattt 11100 

ctggcaaagg tagcaaaacc ttaacaaatt ttgaatctag ataataaaat tatgaggctg 11160 

ggtgcagtgg ccctgacagg gatggctcac atctgtaatc tcaacatttt gggaggccaa 1122 0 

ggtaggcgga tcacctgagg ccaggagttt gagaccagcc tggccaacat ggtgtaaccc 11280 

tgtctctaac aaaaatacaa aaaaattagc cagacgtggt ggtgcacgtc tgtcatccca 11340 

gctactaggg aggctgaggc aggagaattg cttgaacccg agaggcagag gttgtgatga 11400 

gccgagatcg cgtcattgca ctccagcctg ggcaaaagca agagcgaaac tctctctcca 11460 

aaaaataaaa aaaaaataaa ttaatgaatt aattaaaata aaataaaata atggatagtc 11520 
actgtaaaga aaaaataaat gtatatatca gccaacaagt gatggaatag agcaccccat 
ctccctggct ggacagatac atcccacaac acctggaagg cggctccatg tagaactttc 

tggactgctt gaggtgctgt gctggagcac ggtgacagag gagctggacc atggacctcc 1170 0 

cccggccccc accaagggcg aggtccccct gtggtgggtc tgagggaggc atccgtatgg 11760 

cctctgcggc ttgggcaggg aatttggggt ccaagtactt ggtgcaaagc ctggaaagag 11820 



11580 
11640 



f 
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ggtttgggtg ctgagggcat atcccctggg ccacatgggg gcagaagtgg ggccccctga 11880 

agcttggagt cctgggcagg ggcatctatt ttgctgtctg aggccttcag tacttgaagc 11940 

aaaatggagg cagaatgtcc caccttaatg cccctgattc ctccaaacca attccagaga 1200 0 

cagcaagggc cagaacaggg atggccctgc ccagggtcat gcancgagga agtggccagg 12060 

ctgggatctg aacccaggct aatcccctcc cttgtcctcc tccaggccct cacccctgca 1212 0 

tagagccctc cagctcactc atcctcggcc agctccatct cctcagcttg taaacccccc 12180 

cgggattttc ctttcttaaa aaacaaaggc ttggccaggc acggtggctc acgcctgtac 12240 

tttgggggtg gctcccagca ctttgggagg ccaaggtggg cggatcatga ggtcaagaga 123 00 

ttgagaccat tctggccagc atggtgaaac cctgtattta ctaaaaaaaa aaaaattaac 12360 

tgggcatggt ggctagctac ttaggaggct gaggcaggag aatcgcttga acctgggaga 1242 0 

aagaggttgc agtgagccaa gatcgcgcca ctccacttta acctggcaac agaacaagat 12480 

tccgtttcna aaaacaaaca aacaaacaaa taaacaaaaa aaggcggagc gcgatggctc 1254 0 

gcgcctgcaa tcccagcact ttgggaggct gaggcgggcg gatcacttga ggttaggagt 12600 

ttgagaccag cttggccaac atggtgaaac cccatttcca ctaaaagtac aaaaatcagc 12660 

caggtgtggt ggtgggtgcc tgtaatccca gctactcagg aggctgaggc aggagaatcg 1272 0 

cttgaaccca tgacctggag gctacagtga gctgagattg cgccactgta ctccagcttg 12780 

ggcaacaaga tttgtttctc taaaaaaaaa aaaaaaaaga ctggcccttc cccttcagct 12840 

cttcctcagg gtccctgagc actctacacc cccgtctaca ctgagcactc caccctgctg 12900 

tctacactga gcactccacc ctgccatcta cactgaggac tccaccccac tgtctacact 12960 

ggctgcctcc cgccctcacc tcctgctaag gccattcccc gctgcatctg tcttctagat 13020 

tctgcagcct tcagcacgct gggcccctcc tttgtcccct tgagccacct ccagcctccc 13 0 80 

cctgagctgc tactcctctc ccagcagcct ccacccaagc ccctccagtc cccaagctgt 13140 

cccttgcatc cagcactgcc cttccacgtg ccccttccct ccagcttcac agcagggtgg 13200 

ggcctccagg ccctgcccac tgtgcccatc cacaagttgt ggtgggagct ccgaggggag 132 60 

gcaggggtgt gcatggactt gggacgtcca agtctgggac caggggcagc tggttggtgg 1332 0 

agtgtggagg gggataggga ctttcaggta gagaggctgt aggggcaaga tcgggacggc 13380 

ggatgtccct aaggagggct ctgacctggg aaatattgtg cagcttcctc tttgccattc 1344 0 

ctggagctca gacactggcc ggctctcacc ccgcccttcc tgcaggacac agctccatcc 13500 

cagtgagttc ctagtgtaga catctccagc agcacggatg ggaaaggaag tcatcaaagg 13560 
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tgcccaggac cggaggcttt ttctggaggt ggcagaggag ggtgtgggtc tcagggctct 13620 

ggctgagggc aagcgtggga ggtcttaggt ctgcaccagc cccgtgaagg cccctcctgc 13 680 

tccctggtgg agtcctagag ggaacagcag cccctaggct ctagcaggag tgggtagggg 13740 

cttttctggc -ttcctactgt gccagcagga tagctgggcc tggcactgag cccaaagatc 13800 

acatgccggg gcattggcgc agtgaggaac agacccttgc caaagctggc aaagaagacc 13 860 

ccatggggtg cagctggtga agctgagagc tcaatgtttg ggggagcctg gcaaaagggg 13920 

tcctcccctc cctctgcagg ccaggatcgc aggttttccc tacatgttgg taattctcaa 13980 

acaatcccat ggccactgga gcaaagatca cagtgggcgg cggcctcggg agcagtggac 14040 

agggcacgca gtgcctttga tgccagagcc ctcgccccaa agtcaacaaa ctctgcagcg 14100 

gactttgcac ccggactttg ttttcaccat acaaggaaag ggacagatca caggccctct 14160 

cgctgccctc gctgagccgg aagctgcagc gtgagctctc tcaagcccca tttctaggtt 14220 

ccccaggcgc acccctgagc '. ccctactcgc ctattaagtt ctcctaafag cccttcaagg 14280 

tcttaatgta tgtccattag acagagggga aaactgaggc gagggcaagt gacttgaccg 1434 0 

aggttcctcg gcgagcaggg cgtggagctg agaacctcgt tattactgct ccccacacaa 14400 

ccctctggcc gttcttggaa gaaggctgag ccccgggggg gccagagtga cccaaacacc 14460 

atgggccgcc tgcggtaaca cgtgcggcca cgaaggggca gcagtttccc gcccggccgg 14520 

gctctctccg gcgctcagta tccgtcccag gccaagaaga agaaactcgg ggaggagggc 14580 

ggagggggct gcgtgggagg gcgtggaaga tggacgtggc caggggagtg gcagctgcac 14640 

acagtggatg ctgttaagat gaagggaaag aacgtgggct ccgagatcac tggacacggt 14700 

tccacctttc ttcccgctca ctgcatggcc ctgggcgggt tgttgaaccc ttggaaacct 14760 

gtttttcctt ttttcctttt tttttgagac agggtcttgc tctgtggccc agactggagt 1482 0 

gccgtggcac gatcttggct cactgctgcc tcccaggttc aagtgatcct cccagctcag 14880 

cctcctgcgt agctgggacc ccaggtatgt gtcaccacag ccggctaatt tttgtatttt 14 94 0 

tttgtagaga cgggatttcg ccgtattgcc caggctggtc tcaaactcct gagttcaccg 15000 

gatcttcctg cctcagcctc ccaaagtgct gggattactg gcatgagcca ccgcacccag 15060 

cagagacctc agttttctaa cctgtgccag caggaataat gatagctgcc tagcttggct 15120 

gtgctgggaa ttaagtaaga tgaccgggta gcaaatatga agtattactg gacacagagg 15180 

gccccaggct gggttagcag cggtggtcag ggctgctgct tcctggcctg agctcgaagg 15240 

agggccctca ttaccacctg ggtgagtcct cgtccaagcc tggcactgct gcgtgggaat 15300 
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aacttctgcc . acccaagttg gcagattgtg tgcaaagtta agtcctgact ctgtggggtg 


15360 


gacttcgagg cctcttcatc ggacctgctt ccggtgactg 


cattcgcacc 


tcctcctgtt 


15420 


cctggtttaa cacagcccag ctttcctcct gctgagccct ccctgggcct gctgtcaccc 


15480 


tcqtqccqct gtgcctcgca gtgccactcc ctgtaccctg 


aatactttgc 


cctgcctctc 


15540 


cacccagctg agagtcaggg cccctgtgag gctctgccca 


gcccgtcctc 


cgggtttctg 


15600 


cctctgctga gcacttccct gcatgattgc ttctgagagt 


ccccccagcc 


tgtgagcttc 


15660 


tcaggactgg gacagcttct caggaccgag gcttcctggt 


ctgcttgcaa 


ttttacaggc 


15720 


gggcacattt tcccttggcc aacatcagag actggacatc 


tgcagatctg 


tgctagccac 


15780 


tgagcaccca ggcaccccag caggtagctc tgtaaccaac 


ccattctgta 


aagctgaggc 


15840 


tcagagaggt gaagcgcctg gcctggggcc acagcctgcg 


tcagctgcag 


agccaggagc 


15900 


tgagatatgc acctgcggct ctgctcacag ggtcctgcac 


agactgctgc 


tggagccacc 


15960 


tatgtagagt caagagagtt catgttaact ccctctcaca 


tccctcagcc 


agggtggggg 


16020 


ctgacgatag acactcaggg atggcctacc ctccccaaca 


acccccgtca 


ggtttgccgg 


16080 


atctccttgg aagaaaagtt ctgggcagaa ttccaccgtt 


ggcctggcct 


acactctcct 


16140 


tagtggctta ggaccctcag cggtggataa gttgtgggca gaagagatgc 


aatcaggatt 


16200 


ctcacccact caccccttgc cagccccaat aagctcaata 


agctgggctc 


ggtctgagga 


16260 


agtgtccagg aaatgtgcaa atggcctggg acagccctgt 


gttcctttca 


gtaaggttgc 


16320 


tgaaggtgag gctgaaagtt ggagaaacag aagccagtgc 


ttatggtttt 


aattaagata 


16380 


atggaatgta tgtatgtatg tatgtatgta tgtatgtatt 


tatgtattta 


tctttagaga 


16440 


tagagtctca ctctgttgcc caggctggaa tgcggtgaca 


caatcatagc 


tccttgcagc 


16500 


. ctcgacttcc tatgcccaaa tgatcctcct acctcagcct cctgagtagc tgggactaca 


16560 


gacacacgcc aactatgcct agctaatttt tatttctatt 


ttttgtggag 


actgggttct 


16620 


cactttgttg cccaggctgg tcttgaaccc ctagcttcaa 


gcaatcctcc 


tgcctcagcc 


16680 


tcccaaagtg gagggattac aggtgtgagc caccacacct ggcctggaat 


ttatttgtat 


16740 


tctgcttata aaattaatac attcttattg cagaaaagtt 


tgaaaataaa 


agaaaggaca 


16800 


aagaacaaaa agcgtatata atttcacagc tcagatctca ctgctattaa 


catttttatt 


±o 0 O \J 


tactttcagg cttttttctt tctaggtaca tatgcagaga 


ttattttatt 


ttatttattt 


16920 


tattttatat tttattttat attttttatt tcattatttt 


attttatttt 


attttattat 


16980 


ttttagagac agggcctcac tctgtcaccc aggctggagt 


acaatggagt 


gatcatagct 


17040 
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cactgcagcc 


tcaaacacct gggctcaagc 


aatcccccca 


ctcagccttc 


tgagtagttg . 


17100 


ggactaaagt gtgagtctgg ctaatttttt ttactttttg tattgacaga ggtctcacta 


17160. 


tgttgcccag 


gctgatctca aactcctggg 


ttcaagcgat 


cctcccacct 


tggactccca 


17220 


aagtgctggg attacaggca tgagccacca tgcctggcct 


aaaatgccac 


tttttgtcat 


172 8 0 


ttactaaaat 


cccatggaca ctttgacatg 


tctgtattct 


atgctattga 


tctgactgtt 


17340 


ggcatctaca 


tcattatggc catctatcat 


ctatcataat 


ccattttaac 


attaaaattg 


17400 


tgctgctgct 


tagatttttc tggcctgtct 


cctatttgta 


ttcttccaga 


taaattttag 


17460 


aatcatttta 


tcaaattccc cttgcagaaa 


aagccctatt ggattttggt 


tgaaaaatac 


17520 


tgaattttta cattaactta ggaaagggct gggcacggtg gctcacgcct gtaatcccta 


17580 


cacttttcga 


ggccaaggca ggtggatcac 


ttgaggttgg 


gagtttgaga 


ccagcctggc 


17640 


caacatggtg aaactcggtc tttactaaaa 


atacaaaaat 


tgccaggcgc 


attggctcac 


17700 


ctgtaatccc 


agcactttgg gaggccgagg 


tgggtggatc 


acgaggtcag gagatagaga 


17760 


ccatcctggc 


taacacggtg caaccccgtc 


tctcctaaaa 


atacaaaaaa 


ttagccaggc ' 


17820 


gtggtggtgg gcgcctgtgg tctcagctac ttaggaggct gaggcaggag 


aatggtgtga 


17880 


acccaggagg 


cggagcttgc agtgagccaa gatcgcgcca 


ctgcactcca gcctgggcga 


17940 


cagagtgaga 


ctccatctca aaaaaaaata 


ataataataa 


tacaaaaatt 


agccgggggt 


18000 


cgtggcgtgc 


acctataatc ccagttactt gggaggctga ggcaggagaa tcgcttgaat 


18060 


ccaggaggtg 


gaggttgcaa tgagcagaga 


tcgtgccact 


gtactccagc 


ctgggtgaca 


18120 


gagtgacact 


ctgtgaaaaa aaaaaaaaaa 


ttctgaagga 


ttgagactct 


tagactctta 


18180 


ggtcttccta 


tccaagagca caatatagct 


tttcatgtat 


tcaagccttt 


ttcaatgcat 


18240 


caacagaatt 


ttacagtttt tttcatgata 


tcctgctatt 


tcttataaaa 


tgtattccta 


18300 


gatattctgc 


atgttttccg gttgtttgtt 


aataaatatt 


tttcatttgt 


cattatttcc 


18360 


taattggctg 


ttatttgtat atatgacatc 


tgttgaattt 


tttgattact 


ttgaaaatgg 


18420 


ccattctttt 


gtgttttttt ttaactttct 


attttgagat 


aattttgact 


tacagaagat 


18480 


ttgcaaaaat 


agtacagaga gttcctgttt 


cccccttatg 


ttaacccagt 


ttctccttat 


18540 


gttaacatct 


tacataacta : cagaacaatt 


gtcaaatcta 


agaatcaacc 


tgggcacaat 


18600 


gctattaact 


aaactgcaga agctgttcag 


atctcaccag 


ttcttctact 


gctccccttt 


18660 


tctcttccag 


tgttcaatcc ggaatcctac 


attatattta gttgtcattt 


ctctttggtg 


18720 


tcttccaatc 


tgtgacagtt cctcagtctt 


tctttgtctt 


tcatgacttt 


cattttttta 


18780 
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tacttttgaa 


aaatactggc 


cggttgtttt 


gtagaacgcc 


ctcagtttgg gtttgcctga . 


18840 


agttttttgt 


gattagatcg 


aggtcatgca 


ttattggaga 


gggtgccacc 


gcctcgatgt 


18900 


gcaagctcaa 


tgcatcatat 


cagagggttt gtaatgtcag tttataccgc 


cggagaccct 


18960 


aacctggagc 


atttcgtgaa 


ggtgctgtct 


gccaggattc 


tccactagaa 


agttactatt 


19020 


tttccctttt 


taattactga 


atgtctgagg ggaaatactt 


tgagactatg 


caaatatcct 


19080 


gtttctgctt 


taacttcggc 


tcactaagtt 


tagcattcat 


ctatggatct 


cgcttatagc 


19140 


aagtattact 


gtggagttct 


aatggtaatt 


ttctgtttct 


ctcattcctt 


caacctttat . 


19200 


taatatgctt 


cttcctcact 


tattcatttt 


gtttcagttg 


tttataccaa 


catggatttg 


19260 


tggatattgg 


ttttattctt 


tgggttgcaa 


ttgaatccta 


tcattatttt 


gttagtcagt 


19320 


tgttccatcc gaccttggtc 


attaggagcc 


cttgaaattt 


ggctcccatg 


cctttttttt 


19380 


tttttttgag 


accgagtctc 


actctgtcac 


ccaggtttga 


gtgcagtggc 


atgatcttgg 


19440 


cttcctgcaa 


cctccgcctc 


ccaggttcaa 


gcaattctcc 


tgcctcagcc 


tcctgagtag 


19500 


ctggtattat 


aggcgctcca 


ccaccttgcc 


cggctaattt 


tttgtatttt 


tagtagagat 


19560 


ggggttttat 


tatgttggcc 


aggctggtct 


caaactcctg 


acctcaggtg 


atctgcccgc 


19620 


ctcggcctcc 


caaagtgctg 


ggactacagg 


cgtgagccac 


cacacctggc 


ctcctatgcc 


19680 


attttaacat 


gcccgtcttt 


tctttttctt 


tcctactttc 


tgtgactgta 


agaagctcca 


19740 


ggatacattt 


ttgctgccct 


agacttagcc 


tcaatcagtt 


ctcagaaaag 


ctctggttct 


19800 


ttttatggga 


tacttagaaa 


actagctctg 


tatggcctgg 


cgcggtggct 


cacgcctgta 


19860 


atcccagtac 


tttgggaggc 


cgaggtgggc 


agatcacaga 


tcacgaagtc 


aggagatcaa . 


19920 


gaccatcctg 


gctaacatgg 


tgaaactctg 


tctctactaa 


acatacaaaa 


aattagtcca 


19980 


ggcgcggtgg 


cgggcgcctg 


tagtcccagc 


tactcaggag 


gctgaggcag 


gagaacggca 


20040 


tgaacccggg 


aggcggagct 


tgcagtgagc 


cgagatcggc 


agccactgca 


ctccagcctg 


20100 


ggcc.acagag 


cgagactccg 


tctcaaaaaa aaaaaaagga aaaagaaaaa agaaaactag 


20160 


ctctgtatgc 


tagttttttt 


tttaagacag 


ggtctctctt 


gccccagctg gagtgtagca 


20220 


gcacgatcac 


agctcactgt 


agcctcaacc 


ttctgggctc 


aagcaatcct 


cctgcctcag 


20280 


tctcctaagt 


agctgggtct 


acaggcatgc 


accaccgtac 


gtggcaattt 


ttaaaaactg 


20340 


tttgtagaga 


tggagtctcc 


ctatgttgcc 


tggtctggaa 


ctcctggcct 


caagtgatcc 


20400 


tcctgcctcg gcctcccaaa 


gtgctgagat 


tacaggcatg agccactgta 


cctggcctgg 


20460 


ccaaggtctg 


tcttttttta 


aaagaagttg 


ttgtatagtt 


gttttttttt 


ttattttttt 


20520 
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ttctgagacg gagtctcgct ctgtcgccca ggctggagtg cagtggtgcg atctcggctc 


20580 


actgcaagct ccgcctccca 


ggttcacgcc 


attctcctgc 


ctcagcctcc cgagtagctg 


20640 


ggcctacagg cgcccgctac 


cacgcccggc 


taattttttg 


catttttagt agagacgggg 


20700 


tttcaccgtg ttagccagga 


tggtctcgat 


ctcctgacct 


cgtgatccgc ccgcctcggc 


20760 


ctcccaaagt gctgggatta 


caggcgtgag 


ccaccgcgcc 


cggcctgttg tatagttttt 


20820 


atctcgagtt ttctagcgat 


ttaatcatat 


tggttacaaa 


aaaggatgat tttactacct 


20880 


cctttccaat gtttctacat 


attttttcat 


tttatctaac 


tgcattttaa aataaacttt 


20940 


taattttaga atggtttcat 


atttacagaa 


aatgtgcaaa 


gatagtacag agagttcctg 


21000 


tgtactccac acccggtttc 


cttattatta 


tcttaacgtg 


atacacaatt aataaaccag 


21060 


taacattatt attcactgaa gtccacactt 


tctttttttt 


tttttctgag acggagtcta 


21120 


cttctgtcac ccaggctgga 


gtgcagtggc 


gcaatctcgg 


ctcactgcaa cctccacctc 


21180 


ctgggttcag gcaattctgt 


ggctcagcat 


cccaagtagc 


tgggaataca ggtgcccgcc 


21240 


accacgcccg gctaattttt 


tgtattttta 


gtagagatgg 


ggtttcacca tgttagccag 


21300 


gatggtcttg aactcctgac 


ctcgtgatct 


gcctgcctca 


gcctcccaaa gtgctgggat 


21360 


tacaggcgtg agccaccgcg 


cccggcgtcc 


atactttctt 


tagatatcct tcctttttac 


21420 


ctaacgtcct tcttctggtt 


caggatccca 


tccagaaagc 


aacattaccc ctcgccatca 


214 80 


cgtcttcaca ggctcccctt 


gacgggaaga 


gttcctcaga 


ctttccttgt ttttgttgac 


21540 


cttgacagtt ttgaggagga 


ctggtatctt 


agtctgtttt 


gtgctgctat cacagactag 


21600 


ctgagaccga tacatgatac 


atgaaaaaaa atgtattctt acagttgtgg aggctgggaa 


21660 


gttcaagacg aagttgctgg 


ttggtttggt 


ctctggtttc 


aagatggcgc cttgctgctg 


21720 


catcctctgg agaagaagaa tgcggtgtcc 


tctcactgca gaagatggaa gcgctaaaag 


21780 


gaatgaactc cctttgccaa gccattttat 


aatgggcatt 


aatccacaaa ggatgaaacc 


21840 


ctgagaaaca tcaagcttta 


aagcactggt 


tctcaacctt 


tttggtctca ggagcccttt 


21900 


atactcttaa aacgttttga 


ggatcccaaa 


aaaaggcttc 


tacaggttcc atcttttaat 


21960 


atttaccata tcaaaaatta 


aactgaaaaa 


attttaaatt 


atttattcat ttaaaataac 


22020 


aaggataaac ccattacatg 


ctaacataaa 


tcatgtattt 


tatgaaaaat agctatattt 


22080 


atcaaaacaa aaattagtga 


gaagagtggc 


atgtataatt 


ttttttgttt attttttgtt 


22140 


tttagatgga atcttattct gtcgcccagg ctggagtgca gtggtgtgat ctcggctcac 


22200 


tgcaagctct gcctcccagg 


ttcacaccat 


tctcctgcct 


cagcctcctg agtagctggg 


22260 
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actgcaggtg 


cctgccacca 


cgcccggcta 


attttttgta 


tttttagtag agatggagtt 


22320 


tcaccgtgtt . agccaggatg gtcttgatct 


cctgaccttg 


tgatccaccc 


gcctcagcct 


22380 


cccaaagtgc 


tgggattaca 


ggcttgagcc 


actqcqtctq 


gcctaaattt 


ttgtgaatgt 


22440 


ctttaatgcc 


tgccttctca 


tatttgtttc 


tgcattcaag 


ttattgcaaa 


atgttgtgtt 


22500 


ggttgaagtt 


tgtaaagaaa 


atgtggcctc 


atacagttgt 


gtagttggaa 


aggcaagagt . 


22560 


attttgattc 


tctcttcaaa 


caactatgga 


caacctgctg 


ttacaaaacc 


agaatgcaaa 


22620 


aagttgtagt 


aaatacaggt taggtgtagt gtggaatctg aaagcatgtg aatgaacttt 


22680 


ctgagttttg 


taacattaaa 


gtccagttgc 


gttaagctac 


tgtgatagca 


tatagcattg 


22740 


tcctaatact ggaattagta tcagaagtgg ggtgctactg 


ttaataaata 


aaaagaaata 


22800 


aataaatcat 


gtgatactgg 


ctcagaagtc 


aggcagtagg 


ctgtgtggaa 


cctgacatca 


22860 


cgccatgtaa 


tacattggca 


accatttgat 


ccagctgtct 


gtcatgatga cttggaaagt 


22920 


caaccacata 


cttacagagc 


ctgtagacat 


aggggaaaat 


agtataaaac 


agaatactaa 


22980 


cagtggacct 


tggttcttgc 


cagttgcatt 


tagccaaata 


ttaaacaaaa gagatattct 


23040 


tgggcagcaa 


ctggaccatc 


ttcaagtaaa 


agtgaaaggt 


aataaacaga gtccagacat 


23100 


ttgtgcccat 


gcgggttaag 


aaaaatccag 


ttgcttctag 


acaccgtata 


tgaaaacaac 


23160 


gctgaaaaca 


agcctttgag 


tggtaaaggc 


cgattaacac 


tcagcgcggt 


aacaaagacc 


23220 


aggtgggcta 


acccgaaatg 


aaatgagaag 


cctgtggtga 


tgaggaggca 


gagaagtaaa 


23280 


atcaagtttg 


agcatttcgt 


ttaggagagt 


ttgggctctg 


attacttgca 


catgcaaacg 


23340 


aactggaaac 


aaacagatca 


gatgtctacc 


acttcttcga gggaattgca 


ttgccaaaga 


23400 


agtcatgaaa 


gcagactcta 


tactgattag 


gcattaaaac 


aaaaacaatc 


tttaggcccc 


23460 


taaacttgca 


"tgggcaggaa 


gtgggctgtc 


aaagctgttc 


atcctctaag 


gtggacctag 


23520 


ttcctagtcc 


ccagtataca 


cttcagatgt 


ggccctggag gacactggac 


atggaggacc 


23580 


tcccagagga 


tgaggctagg 


gcttcatttc 


tccaatgacc 


tcagctgcct 


ctatttcccc 


23640 


ttcttcctct ggaagtccta tcatcgttat 


tattattatt 


atcatcattt 


ttattttgag 


23700 


ataaggtctc 


gctctgttgc 


ccaggctgga gtgcagtgac 


atgatcatgg 


ctcactgcag 


23760 


ccctcccagg 


ctcaagtgat 


cctcctgcct 


cagcctcctg 


agtagctggg 


agtacaggca 


23820 


catgccacca 


tgcttggcta 


tttttttttt 


cagtagagat 


agggctctca 


ctatgttgcc 


23880 


agggctgatc 


tcaacctcct gggttcaaga gatcctccta cctcagctcc 


tgagtagctg 


23940 


ggattcgggt 


gcacaccacc 


atgccaacta 


atttttaatt 


tttttttgta 


tggacaggat 


24000 
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rYT*ar ,, arrf~rT'h*t* 

y LdL.dy Ly L L 


arraaah aaa t 
cty ctctci uyyaL 


tacttGcacra 


ooc acfaaacra 


tcacttgagc 


ccaggagttt 


24060 


rra4-|'tar , ar* , 'f~ PF 

y d l LctucKw Ly 


Ly ctdvL^a Ly a . 


tcgcacccct 


gcactccaat 


ctgggcaaca 


aaqtcraqacc 


24120 


l. Ly lv l Lddd 


ciaaaciciuacici 


aaaacracracfa 


gagagagact 


caaagatagg 


caaaaaagtg 


24180 


y y cicicty luul 


a-i-a rrf- rrrra pa 
ct Lay Ly y q-^ci 


a aaaaaaaca 


ctctaagtct 


accctattaa 


catggtgctg 


24240 


aaggtgggcu 


aau L dy dy ct L 


dyyyyy *— a v u. 


a tat crcrt taa 


ctatgggtgc 


atctttggct 


24300 


ttccctgggt 


gaucccaagu 


Ly y ddy t-cty y 


yd^dCtd-CLCLU- L- 


a crcrcf a a cr p t a 
dyy y ctcty ^ t-y 


ttacittattc 


24360 


at cacgt tct 




gac uyy tcy u 


y ct l, cty ct cty l> l< 


aftattttaa 




24420 


tgg ctcacg c 


G Lg LddLCL, L 


Ciy LLU L L 


Cf a dt t paarcf 


tcrcrcrt Qaatc 
u yyy u yy ^ 


aaaaacfCfaQcr 


24480 


/~f ^ fT/— f SI 4" 4" 4~ ff/T 

gaggactcgg 


i-ra rrrft - p a rrrra 
y clyy LL>dyy d 


a t" i~ 3 cr p c t* era 1 




daaatcccat 


ctctactaaa 


24540 


del L»ciOciciclci& 


l l ciy l. uy y y l* 


y *-yy ^-yy u y 


atcrcc tataa 


tcccagctac 


tegggacget 


24600 


gaggcaggag 


aaucagt uga 


acucyyyy dy 


g»-yyciyy uLy 


ps rr t* era nppa 
ciy u< y ciy ^ ^ ci 


acrat ccrtQCC 


24660 


caatt t catc 


Lcaaaaaaaci 


a aaa aaan1"t" 
dddddddy u l. 


ctL^yu u u. cty ^ 


1 1~ ppt pdrat t 

LLLLLLyCll, lw 


at t actcrcrac 


24720 


gtagtaatct 


ggcttcctgc 


aagu c uaac u 


4~ 4- r~* a a rra **"• 
CtCayCdydO 


uyycudLduy 


rrrr pi~ rrt" rrt" a P 
yy uy L.y LdL. 


24780 


t g tag at aag 


yCagUday to. 


ddy^ddddctu. 


^-y ct Lay ciy ^ cl 


fpaa rrrra f"aa 
i_ ^ ci ci y y . ci i~ ci 


ataaaaaatc 


24840 


cgcaaccaag 


r* a rr a a rf O f» 4- 4- 

Cdy day dL L L 


yaauciV' uuLa 


p 1 1 1 naataa 


rf crafaaaar 

uy m lcicicici^> 


aaataaacaa 


24900 


aaaaaat cag 


Laagga Ly La 


y d dy d L- u c y ct 


ct^_rCLCL^y i_- ci ci L- 


4~aar*aaar't"t" 

UCldVClCluV^ l_ u. 


aa pt taattt 


24960 


acacgucxag 


a s r*< r* 1 r* 4- rrran 

aaccc t.g cay 


aararflparh 
dd^dwd^ciw i— 


t" 1" t~ t pa a cjpa 

L W U CLCty V— • ct. 


t a p t raaaac 

uau u^c*y cio.^ 


atttatataa 


25020 


agtgaccata 


egguggacca 


Uddcty Udy L. L. 


fnaaraaafn 

U VoCtCtVvpCtCLCt L. 


l. l> ct ^ cty y ciy i— 


a aaataacaa 

ClCLCLCl C^t v-p 


25080 


accgtgtt t t 


ctgaccgt aa 


t~< +- -3 a r*TT" t" a a 
y uauay L- Lad. 


Lay ddd i_ l. 


ftaaaaraaaa 
yddddwdddct 


a apt aaa aaa 

ay w tuv a aaa 


25140 


accccaug ta 


4- 4— r-f r— f "i rs *a 4~ 4~ 

tc cgyaaa l l 


u Udd uauct^a 


P"f~ 1~ t" r?a aafa 
ull uy ctctct u ct 


a pa a a "hcrcra t* 

Ct^CtCtCtLyMCl L* 


raaaaattaa 

w ay ay ^ w mm 


25200 


ttcaaatagg 


aatttagaaa 


LaCCCtydat 


4™ rra a a 3 ai" a a 
LyaaadaLact 


hnana a t - a pi" 
L.y dy adLat u> 


a i— civ v w aan. 


25260 


actgtggggt 


gcagctgaac 


agcauauaya 


ripfa aaa it4** a 4- 
CyddcLciy Ldl 


ar't' ra i" a t" rr t* 
dLLLaLaL-y L 


rrpat"appt~t"a 

HLCllCtvL L LCI 


25320 


aggagegggg 


aggattgaaa 


gu taauggga 


fxcir* a a a a f~* a 
y y Cddddy t-d 


rrrri - rrrra hpap 
y y LyydLUdt 


f- 4-rra crcft* taa 
L vydyy l Lay 


25380 


gagttcaaga 


tcagcctggc 


raacagggcg 


aaar , r i /™'r'a+T/^ 
addLttLdLL 


fpfaphaaaa 
Lav Ladad 


dLdLaacicicia 


25440 


ttat ccaggc 


guagtgaggc 


+- (-Ta^*m^ w, a a rra 

u y dy yod dy a. 


naa t* prT+" t"(T(*r 
yddL^y l. L.yy 


a a p p p a rrrra ft 
aaL.LLdyydy 


rrp a c*ra acrt" ta 
y vdydyy \- Ly 


25500 


cagtgagccg 


cgattgcgcc. 


actgcacccc 


agectgggag 


acagagegag 


actccatctc 


. 25560 


aagaaagaaa 


aaaaaaaaag 


aaaaggccag 


gcgcggtggc 


teatgectgt 


aatcccagca 


25620 


ttttgggagg 


ccgaggtggg 


eggatcaega 


ggtcaggaga 


tcgagactat 


cctggctagc 


25680 


aeggtgaaac 


cccgcctcta 


ctaaaaatac 


aaaaaaatta 


gecaggegtg 


gtggcgggtg 


25740 
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c c t* a t* a rt *rc c 
u> ^ uy ay luu 




craaaa c t cr acx 


gcaggagaafc 


gtcatgaacc 


cacxcracrcfcacr 


25800 


3 a c 1~ f~ a c a crt" 

d y ^ l_ L»y ^» ay *— 


cr a cr c c* a aa a t 


cgcgccacfcg 


tactccagcc 


tQcrcfcaacacr 


agagagactc 


25860 


t" crt* ci~ raflfla 

L- y v— ^ L-UCluQ. a 


cicicicicinuaau 


gttaatggga 


taaacatcca 


tctcaagaag 


ttagaaagga 


25920 


"i 4— ^ j^i ^ ^ ^ 4>* ^ 

atyaCaaata 


SS/'tf^aaaaaa 


CtCLCLCLCt-CLCL U-l^CL 




a t" ca t" aacrcrt 


caagactata 


25980 


aayayay cgy 


ctgggugcag 






a err* a t* t" t~ f~ aa 


cr a a cr c a cr a crcr 
y cici y v->»y »33 


26040 


tgggcagatc 


acccgagccc 


agg ay t tcda 


y ciuv-.a.y L.y 


a rr*h a 3 pa h an 
cty ucLcioci i_cty 


a rra rra cr* t" r*a 
cty ay ci^l luo. 


26100 


t ct tfcgctga 


aaataaaaafc 


■ *i ^ *^ ^ ^ *f— "i - ^ 

daaaaaLUay 


cty y <_ d l, y y 


uy y l. do i_- y cty 


g t-yyy^yy «. l - 


2 6160 


CaCtUyagCC 


taggaggtug 


a rrrrr" t~ err 1 a crt - 
dyy l uy uciy u 


ci ciy ^uauyciL. 


t"crt _ cffT , ar , t'Ci 
wy tyu^»ui.y 


pa rt-'hr'acrcc 


26220 

W W £i U 


uyyy u y d u cty 


cty uyyy au.u*w 


L»y \* <— ucio 




a a crcrc t" crcra'c 

C4. CI ^ ^ 


crccfcrfc crcrc t c 
y^yy L, 33 vl *^' 


26280 


adatL Uy Lcici 




t* t* aaa a era c r* 
i— ^yyy^yy^ 1 * 


a a cr cr c t" cr a crcr 


t* ca crp acrttt 
v»> ^y » ^y 


gagaacagct 


26340 


tggccaacaa 


gacgaaaccx 




aaaaai~ar^aa 


a a a a t~ t~ a rrt - 1 - 
ctcicicLU Lay (_ u 


rrcrcr t" cr1~ crcr t* cr 


26400 


gcatgtgcct 


gtaatcccag 


CEaCtLayya 


/^t/~tt^ n n +~ n +* 

ggniuiiiL- L. 11 L- 


n ff a1"'hat'aH" 

Xig a. t- 1. a. t-d L. t. 




26460 


tacgtcgtta 


ttggactgaa 


u ucagaauga 


f «a 4- y-i 4- p -j 4- 

tydCtCUCdU 


L. gg ay c L. c u l. 


cc\~rtf~ct~cci~ 
Uy LLLULL. 






yCCuCCyaCC 


ULdv^ L. L.yy L. 


4- +- r 1 ^5 f 1 1~ +~ f* a 




cr r* t* r* a t~ 3 ccra 

y v»> l> v-» a i_> a, wy u. 


26580 


gtagatac lu 


*-i /~t 4— 4— yt /-i 4— 4— /—i 4— 


LLutV/CiUt uy 




L>oadL>wUUv^ 


**yy y 


26640 


cccctcctct 




i~i 4- /-*f*rt* ff a /"> a f» 
CtCyCyaCdC 


ULy Ly LLLLL 




aUL>dL^L'UUL 


26700 


tctcgaggcg 


gtcctttcct 


ttatccagct 


taaatacctt 


ctcctctgtt 


tatttggggg 


26760 


ttgggttttt 


atctctcacc 


ctccctctaa 


tttctttcct 


ctttccgcac 


ccatcaagcc 


26820 


tctcgtggtt 


tctcttcctc 


tactctcggg 


tcccccccct 


ctccccttct 


ttttttcttc 


26880 


acccccccaa 


gcgctttgcc 


ttttttttct 


ttgcccttta 


ttcccccc 




26928 



<210> 7 

<211> 29430 

<212> DNA 

<213> Homo sapiens 



<220> 
<221> 
<222> 



unsure 

(4336) , (4345) \ (4349) , (4392) , (4447) , (4490) 
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<223> Identity of nucleotide sequences at the above locations are unknown. 



<400> 7 



a 9999 aa 99g ccggctccgt 


agctcacacc 


tataatccca 


gcactttccg 




60 


atcatctcag gccaggagtt 


caagaccagc 


ctgggcaaca 


cagcaagacc 


gcatctctac 


120 


aaaaacttct 


tttaaagctt 


aaaaaaaaaa 


aaaaaagcaa 


aaacfoacaqt 


tcaggagaaa 


180 


agcctgtaga 


ggcagcacac 


cr cr a aaa a 


av^y ^ oiy > tt 


aa c a c c acrcr a 


aaaactcrcrcc 


240 


atgggcactc 


actcctccag 


caaacaaQtcr 


cccagcacca 


gctggcccac 


ccagacaccc 


300 


aggacacggc 


ctgaatggct 


p r a f* a f~t c a c 


y >-yy y l -yy | - c * 


ataaacaagc 


aatacacata 


360 


gccaataagg acaccttagt 


a a 1~ cs 1" t" a p a t~ 
acx u»y u c ex ^ ci 




crcaaatcacra 


gaaatggtgc 


42 0. 


agggtgaagt 


gggttggggg 


y v_ i_. y \^ d l. y w i~ 


a p at" era era acr 

a v_> y dy ca. y 


t - crcrcr t* c acscscs 
u yyy L( -yyyy 


aactacatcrc 


48.0 


tacctgagac 


agagcaggcc 


t" t" csc t" aaaa a 


aaa acr cr acre c 


acr c a era c c t cr 


aacaaaQcrtc 


540 


ctggggtggg 


agcacactgg 


pcff^cra cr1~ crt~ 


crcrcrcrcrtacrca 
y y y y y ^y ^ ci 


taacaaatac 


tggtcctctg 


600 


ggcgccttcc caccacgtca 


t* crt rrr p pa t" cr 


fc crc c c a aacf fc 


ctctcatttc 


acagccccct 


660 


gaagctcagg 


ggtcacagct 


d cto ciy ^ l» «w 


pa era i~ a rrt" t~ 


acrrrtacccc 

yw^> i—' y *w w ^ »^ 


agg teat tec 


720 


atccagtgat 


ggacctgctg 




+- ri a |-t +- f-« 4- /t/-t 

uy cn_\_ ll i»yy 


y cty y l. ci ct l- 


t~ t~ era aaaaoa 


780 


ggagaaggga 


gggcaacaga 


L-ouyyyy oyci 


t" era crcrcra t" err* 
y ciy y y ci. l» y 


ap aaaataac 


agacacctga 


840 


ggctgcacct 


tggagcctca 


y Luccyyy L.y 


tggguggggy 




Y era crcrcrp t" era 
i-y c-y yy *-y a 


900 


agcagctggg cccggccacc 


dtCaCaeCCC 


dyyauCCaLC 


artaf pappaf" 
ayaULdL'UaL 


rra aaaar pcra 


960 


atgtcaactg 


gcagcccaga 


gtgeagaaca 


aacct ttcag 


aaacacggug 


y LgdLtycL.y 




catcatgaac 


ataaaataat 


cacgccc tec 


ccccayyyat, 


taL-ttt uy Oct 


rrrra nt" t~ t" at" P 
y y cty l. uy 


1080 


ccaagaaaca 


ccagaaagaa 


ggaaaacgt c 


tgagtcacaa 


^ «-» ^ 4-* i*-t t— ^» 
LatutyCtga 


r<nf+ "H a +* t" f" 
yy ol. L. LaL ll 


1140 


gtaatagcaa 


aaaaaaaaaa 


aaaaaaagaa 


caatctccag 


eggcaggggt 


aactagacta 


1200 


ttgtctccgt 


ggaaaggtag 


caccaattaa 


ctagtaacaa 


aatgactgcg 


gtaacaacaa 


1260 


aacgttcgac 


atgtcaacac 


caaaaaccac 


acacccagca 


taaccgtgaa 


ccatgatttc 


1320 


tactagaatg 


aatggcagtt 


atgagaaagc 


accageggag 


acaaagattg 


aaaaagtaaa 


1380 


ggtggcctca 


ttagggagac 


aagtctctgg 


gtaatatatt 


gtaatactgg 


taaatatata 


1440 


gtttttaata 


tattttttaa 


ttccaaattc 


catatatgtt 


cctatgaagc 


tatttctgea 


1500 


aatatttttt 


tcaggaccgt 


acatcacaaa 


ggcaaaaggg 


ccaggtcagc 


tctccagctg 


1560 
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agagtgacca . cttcagagca gacggcagac tccagggtta gcaagcctgg ctgagacctg 162 0 

gcccatgaca atcactcaac ccctctgacc tcaacatcct gtctgtgaaa tggggataat 1680 

tactgcacct ccacatcaca gagtgcgagg cttaaacagg atgcttcata gaaaagcgct 1740 

caagaggtaa cagccgggag ggggtagtgg ttttcattaa ttaaatgttg ccttcatcca 1800 

gccctgggcc agctccaaca caaagcacac accatccact cagactcagt tgcctggatt 1860 

caaagcccgg cctggcctcc agctgtgaga ttccgggcag gatttcccat ctcccagagc 192 0 

ctcagtttcc tcattcatga aacaggaagt. gatcattcct tttattttta tttttatttt 1980 

tattttgaga cggagtttca ctctagttgc ccaggctgga gtatgatggc gcaatctcag 204 0 

ctcactgcaa cctcggcctc ccagtttcaa gcgattctcc cacctcagtc tcctgagtag 2100 

ctgggattac aggcacacgc caccacgccc agctaatttt gtatttttag tagagacggg 2160 

gttttgccat gttggtcagg ctggtctcga - actcctgacc tcaggtgatc cgcccgcctt 2220 

ggcatcccaa agtgctggga ttacaggtgt gagccaccaa gcccagttga caactgcttt 2280 

taaagacacc tctggctgct gtggaaaaca gcctggtagt gcctcaaaaa gttacacata 2340 

gaatgatcct atgaccagta attccactcc tacatatata cccaaaagaa ctgaacccct 2400 

ctactcatgt atgtacacat acaggtacac gcatgttaac agcagtgttc acaaagccaa 2460 

aacatggaaa cagctcaaat gtccataacc gatgaacgga taaatgaaac gtagtctatt 252 0 

caccacctga cggaggtgag aggggccata aaaaggaatg atgcataaaa acgaatatta 25 80 

tggccaggta tggtggctca cgcctgtaat cccaggactt tgggaggctg aggcgggcgg 2 640 

atcacgaggt aaggagttcg agaccagcct ggccaacacg gtgaaacccc atctctacta 2700 

aaaatacaca aattagctgg gcatggtgga gggcgcctgt aataccagct actccggagg 2760 

ctgaggcaag agaatccctt gaacctggga aacagaggtt gcagtgagct gagattgcac 2820 

cactgcactc cagcctgggc gacagaccaa aactccgttt cggaaaaaaa agaaaaaatt 2880 

agccaggtgt ggtggcgggt gggtccctgt aatcccagct ctacttggga tactgaggca 2 94 0 

ggagaaccac ttgaacccgg gaggtggagg tagcggtgag ctgagattgt gccactgcgc 3 000 

tccagcctgt gtgacagaag gagactctgt ctctaaaaaa caaaaacaaa aaaggcccga 3 06 0 

cgcggtgtct tacacctgta atgccaacac tttgggaagc caaggcaggc agatcatctg 312 0 

aggtcaggag tttgagagca gcctgggcaa cacggtgaaa ccccatctct actaaaaata 3180 

cagaaattag ccaggtgtgg tggcacatgc ctgtaatccc agctactcgg gaggctgagg 3240 

caggagaatc gcttgaaccc aggaagcgga ggttgcagtg agccgacatt gcaccattat 330 0 
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actccagcct gggtgacaga gtgagattct gtctcaaaaa 


aaaaaaaaaa 


aaaaaaaaaa 


3360 


ctaaacaaaa 


gcaaaaaaac 


caatgagtaa tgttgtcaag tgaacttcat cccaatggga 


3420 


atgcagataa 


tttgtttaaa 


aggcaccatg 


cacactgggc 


aggctggctt 


cccctgggaa. 


3480 


cgtcttcttt 


tgcctggatt 


cccagttggt 


ttaatcgggc 


gtagaacact 


ttcttcaatc 


3540 


cgggattcag 


gcacccctgc 


tcagcacaaa 


ctcagtacac 


cccgcactct 


gctgtgggtt 


3600 


cttggcacta 


ttaggagaat 


gtgagggggt 


gattcagatc 


tatctctagt 


gggtgcatgt 


3660 


ctgccactcc 


caggaacgcc 


cacttctggc 


aagtcagtgt 


cagagaaagg 


ccagctcgtg 


3720 


gcccctcctg 


ccttgagtcc 


caggacccgt 


gatcagtcct 


acccggagca 


gaatcaggag 


3780 


tttgaaaacc 


caagtgccaa 


caatctcatt 


ttaacccatg 


taagcatatc 


caatatttat 


3840 


atatagaatt 


cataacagat 


gtctgggctt 


ccattccaat 


agcctatatt 


ttacactgtt 


3900 


tatttacatg gttacaccaa 


acaagactca 


attcaaggta 


acccaatcct 


ttgctactat 


3960 


accaaaataa 


gcaacatttt 


cagtccatgc 


cttatatata 


ttcaccaagc 


attacactag 


4020 


gcctccaact gctcatcgga gcaagctgca gcctggacac aagctagaga 


ttaatcagtc 


4080 


aggaatgatc 


ctgcgtccag 


tgccagcatg 


atggaagaga 


cagagaaaca 


gaagacatca 


4140 


gggctccaga 


gtcaaggagc 


ctgcaggtta gttgggcagg 


atatacacac 


atacacacac 


4200 


acacgcacac 


acaaaaccac 


ccaagaagaa 


aaggtgggat 


gaatgcatgg 


acaggtaatg 


4260 


cctggagcct 


ggggatggat 


aagctgactg 


caggtggccc 


aggcaggctt 


cctggaggaa 


4320 


gaagacctgg ctgtangtgg ggtangcang 


ctttctaaat 


ggggaaaatc 


tggctgtggg 


4380 


tggagttggc 


angtttccga 


aaagaagaaa 


agctgactat 


gggtacacct 


ggctgttggt 


4440 


ggaacangca ggcttcttgg aagaagaaaa tctggctgtg ggtggatcan gcaagcttct 


4500 


tggaagaagt 


aaacctgact 


atgggtggac 


caggcaggct 


tcctagagga 


agaagaccgg 


4560 


ctgtgggtga 


accaggcagg 


cttcctagac 


agaggaagat 


ctggctgcgg 


ttagagtggg 


4620 


caggcttcta 


agaagaggaa 


gggctgactg 


tgggtagacc 


tggctgtggg 


tagactgggc 


4680 


aggcttcctg 


gaggaggaag 


agctggagca 


ttgaaaaaca 


aacatgactt 


ggtgaatgtt 


4740 


gagcatgccc 


aggcctgatc 


cccagaggca 


attacgcact 


caagttactt 


aattctactc 


4800 


acaatgcctc 


acaaacaact 


tctctgacac 


ctaacacagc 


tctgggcacc 


ttctagcttc 


4 0 o U 


agctcctcaa 


agcagttatt 


cacgctacta 


ccctgcacac 


etcctcacac 


cccaacccca 


4920 


gggacaggag 


ttctgccaga 


tgccaaagct 


cctgatgcca 


aagcctgggt 


ctgcttccgg 


4980 


gctcctcttg gtctaactgt 


ccaccccgca 


tcggcatgat 


gtgcaaaaac 


aaggctttgc 


5040 
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aatctgccct gatgcctggc ggagcgagtc cctcccgatt cgtctccttc agaaacacct 


5100 


gggctgccct ggtcctgtta 


tacccccaac acattctaca 


gtcagctccg 


caagttccac 


5160 


aaagatcaac gctggcgttt 


ttatggcatt ttatttacag 


tttttacaat 


ataaaaaagg 


5220 


aaggatgcca cagctcagcc 


agcaggacag acagagatct 


atgatgcttc 


tgctgcacca 


5280 


ttgtttgtgg tcaagaaagt 


ctgttttcaa tgatttatta 


aattgtggtg ggagatggat 


5340 


ggtggcagtg gttaccagca 
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acatgaatgt tcttaatgcc 


actgaacttc 


acacttacaa 


5400 


atggttacga cgataagtgt 


tatatgtatt ttaccacaat 


taaaaacagg 


taaatgcagg 


5460 


ccgggcacgg tggctcacga 


ctgtaatctc agcactttgg gaggccaagg caggcagatc 


5520 


acctgaggtc aggggttcga 


gaccagtctc gccaacacgg 


tgaaactctg 


tctctattaa 


558 0 


aaatacaaaa attagccaga 


tgtggtggtg catgcctgta 


atcccagctt 


ctcaggaggc 


5640 


tgaggcagga aaatagcttg 


aaaccgggag gcagaggttg 


ccatgagctg 


agattgtacc 


5700 


attgcactcc agcctgggtg 


acaaaagcaa aactctgtct 


caaaaaaata 


aaataaaata 


5760 


aaaataggta aatgcaaaca tatggtatag taatattatg ggctattatg agctacaaaa 


5820 


aagaatgact tgggactaca 


gttacagccc tcattcagga 


atttgtttta 


aatgtgggtt 


5880 


ggtcgctaag gcatgtacac aacattttga cgttcaaata ttcctagatt tggacagtga 


5940 


gcacccctct aagctggctc 


ttctgtccca gaggtcccca 


ccagtcctcc 


agaacttctt 


6000 


tgctttctta cacaataaga 


tgccccatgc tcggcttgta 


cctttccttg 


ccccagccct 


6060 


agaaccagct tcttcgtgga 


caagctctga ctcctttggg 


tggagaatgg 


tattcagaaa 


6120 


cccagacctg ggctctggtg 


tgctcactgc tacttggggt 


cattgcttct 


aggcctctct 


6180 


gctgatggag gtaggatata 


cacgtacagt cttccctctt 


cccagattcc 


gtacttgagc 


6240 


tcgcctactt gctaacattt 


atttatatcc cccaaattaa 


acctcacagc 


acttctgcaa 


6300 


tcactcactg acttgcagag 


tgtgaaaaaa ctgagtcacc 


atcacacgtt 


ccaaactgag 


6360 


gtcaactgag gccacaacgc 


cccatcttct tgctccggct 


gtcgagatgt 


aagcaagtgt 


6420 


ccttctctcg gtctagctag 


tgccatgctt tccacatcac 


tgtgcttttt 


gtgggcaatt 


6480 


ttgctgtata aaatgtcccc 


tgcacatatg ctgctgtgta 


gtgctcctag 


gtgcatgagg 


6540 


ctgccccacg ccttacagag 


agaatatgca tgagaggctt 


tattcaggta tgagttatag 


6600 


cgtagttggc catgaattca 


atgttaatga atcaacaata 


tacagtaaat 


aaggtgcttt 


6660 


ttagagacag ggtctcactc tgtcacccag gctttagagt ccagtggtgt gaccttggct 


6720 


cactgccgcc tcaacctcct 


gggctcaagt gatcctccca 


cctcagcctc 


ccaaactgtt 


6780 
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gggattacag gcgtgagcta ctgcactcag cctaaataag gtgtcttaga aacacacata 6840 

agacaaggtt atgggctgag tgcggtggct catgcctgta atcccaacac tttgggaggc 6900 

caaggtggga ggttcacttg aggccagaag tttgagaqta gcctgggcaa catggcaaga 6960 

cctcatctgt atattttttt aaatcagaca ggtgtggtgg tgcatgccta tagtcccagc 7020 

tactggagag gctgaggcag gaaaatggcc tgagcccagg aggtcaaggc tgcagtgacc 7080 

catgattgta ccactgcatt ccagcctggg gtgacacagc aagacgctgt cttaaaaaaa 7140 

aaaaaaaaaa aagccaggtc aggtatcgaa cagttggcaa aaacgttgtg acctgaggct 7200 

cacaggaacc tagcccgatg tttcccctag gagcaatggt tcagtattca ataattcagg 72 60 

gttcccagtg actttatgga gcataacttt caagaataac aagaaccaac tgtacgtgtg 7320 

tatgtatact cacactttta ttttatttta ttttattttt tgagacagag tctcactctg 7380 

tcacccaggc tggagtaaaa tggcgtgatc tcgactcact gcaacctccg cctcccaggt 7440 

tcaagtgatt ctcagcctcc caagtagctg" ggattacagg tgtgccccca caaccggcta 7500 

atttctgtat ttttagtaga gacggagttt cgccacattg gccacgctgg tctcaaactc 756 0 

ctaacctcaa gtgatccacc cacctcagcc tcccaaagtg ctggaattac aggcatgagc 762 0 

tgccgtgcct agcctacata cacttttata cacacatgca tctatgacta tttctctatt 7680 

tctgtgcatg tgtgcgtggc agtacctaca gtttcagcta tgtgtctggg tactgtctcg 7740 

tccaagtttg taagcacctt ctccaaagtg caaagcctgg cttgtgttac tatccatatg 7800 

tttacttatt tgctcaatca atttacttat tagctccata accagcttcc catctgctcc 7860 

agtagcctct gctgtcagtc acctctgcac cctaccccac cttgcttccg gatgctggat 7920 

gccaatcacc cccgacacct ctacatagca ccaccctcga catgctgctt ctttatttct 7980 

tatttatttg tttgagatgg agtettactc tgttgcccag gctggagtgc agtggcacga 8040 

tccaggctca ctgcaacgtc cgcctcctgg gttcaagtga ttctcctgcc tcagcttctc 8100 

aaatagctgg gattacaggt gcccaccacc acgcccagct aatttttgta tttttagtag 8160 

agatggggtt tcaccatgtt ggccaggctg gtctcgaact cctgacctca agtgatccac 8220 

cttggcctct caaagtgctg ggattacagg tgtgagccac cgcgcctggt ctgcttcttt 8280 

aaatgccagg caccaacatt tgtgcaatgg ggtgggagga aagaacaggg aggagagcac 834 0 

actgccggcc cctgcactga atccactgat caatctgggg gcaactgcca tctccatctc 8400 

ctgtcttcct atccgtgaac atctactgca gtcctctcca atgtccttct gtaaagttgt 8460 

attatgtttt gcatacaggc cttgcatatt agttctcaga tataatccat atactttata 8520 
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taaaattcaa accacattta aaaaaataaa actagcatga ctataacgga gtctgcaaca 8580 

ttctcacaga ctttatgata aaacatgaaa cttcaaagat acttagggtg gggcagggac 8640 

aatgtttaag gctgcctgga agcctcccca tccctgagcc agaaagtcct atctcccctt 8700 

caaggggaaa tgcttgaaaa agcactgatc aggctaaaat gacagggatc agggagtaat 8760 

caaagtacaa gtgagctggt ctcctccatt ctgagcacag caaagttcag tctctccaag 8820 

tccaagaatc atacacctgt ttgccaagaa tgaagttcag gtgtctacaa gtggctgaaa 8880 

atattcattg ctgggccatt aacaacattc ttggcaaaac cataccttag cttctcgtgg 8940 

aaatttctta aggtagaaga aacaggaaac acccaggctc gcttttatgt agacagttcc 9000 

atgaagccag ggaccttccc cacatccacg tttcaattac ctgcacgcag ctcacagtgt 9060 

attcaacatc tacgcgtctc tcctactggg gtggcggtgg ccactcaaac cctcatgcag 912 0 

ctacgatgac cgcaattttg gcaacataat ttcatgtttt tccttgggct tttacccaag 9180 

tcagtgacac aattctgcag ttgtctaaag attcaaaatg agggacttga catttacaac 9240 

aataataaaa tcttgggttt cctttaacca agcacatgtt ctgcctttta gagaaagctc 9300 

tgcaaactca agctggagtg ggatacttgc tgacatcttc aagcacccca ggaatagctc 9360 

tactccccca tttccacctt ggctgaacca tctatatccc accaattccc ccaacatccc 942 0 

tccatccgtc catccatcca cccaaggacc tgctaagcca ggaggtctct cccatctacc 9480 

ccacagcctg gcctcagccc acaagggctc tctctacatg aatcccaccg caccagagta 954 0 

gaccaagtct cccgtagact ccaccctgac cacctccatg cctccagcca ttcccacccc 9600 

taaaaaccc.t ccctggtctc tacacccagc tgatgaatac ttggctgaat gtgacctggc 9660 

ctcctggacc caggtgaagc ccacgtcctc cgtaagcccg ccagctcacc ctgcctctgc 972 0 

accttcactg gagagagccc gcacttcacc. tcctcagggc aggcatggct gatgccaccc 9780 

agtggaatct ggtgcaaagc agggcccggt gcagagcagg gctgcctgca gagcaaggcc 9840 

ctggtgctgg ggccgagcac ctccaatgct ggccgtggaa ccatccctcc cattccaggt 9900 

gctgtctcca tcaagaatga gcgagctgct gacatttgca tgacaataat gaataaatac 9960 

catattttgc ttcaaatcca gaatagatgt ggccagggtt ggcatatgac tgttgggaaa 1002 0 

ggacagtttg cctcttccca aaccaacttg gattataaaa agcttttctt aacgaccaca 10080 

agagcggagg agctcagggg cagacaaaag gaaggctggc tgcagaaggc gggagagtgg 10140 

ggccttcagg ggcgggtggg gagagagaaa gcctggagct gcacccccaa ggtctgtgta 10200 

catcaggtgc tacagaataa caccacctct tccagcttgg cccccacctg ccctctccca 10260 
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gcccagtcac ccagacagca ccccactccc cacacacacc tcacatctgc ccgcctcaca 10320 

ctcaccagct tcggctctca atgcaacctg gaacctgccc ttggcctctc agctcagcca 10380 

cccccattcc tgttggcccc tggcccccca tcgaattctc tctaatccta atgcacacac 10440 

ttgcacactc aaacacacac aca'cacacac acacacacag - cccagaggaa aaccataatt 10500 

gactgaggtc caggcaagtt tcccgagcag ggaccacatt tcaaaggtca gggaagcagg 10560 

cgaacaggaa acatacaggg ggcacgtttg ggggtggagc aggaaataag aaatcacttg 10620 

caaaagataa aaagaaaatg aggtagctgg tttcagacac ctcggagcac acagaacagg 10680 

a-caggcgcct ccgggtcttc cctcaacagg gagatgggcc aggcaggtcc ctgctgctcc 10740 

accgcagagc tgggggctat ggccctgaca ccaaggccct ggggcaggcg gggaggcagc loaoo 

tgttctcctg cctgtgctcc cgggcagggc ctggccccac aagggaactg gccgaaggct 10860 

ctgcttggct actccggaaa gtcctgggag acaagcaaag gacttgctag gtcactccaa 1092 0 

acggcccaga tgtgacaact gtgaagaagc cacaccaaag caaggtgaca gaacaatgtt 10980 

ggtgacgtca ggttatcagc ttacgctcaa ctccacttac ccggactcac ccgtaacctg 11040 

ccgtctcttc ccaaccagta aaggatgcct aggtagaggg gcacaaggcc tggagcataa 11100 

ttaccatttt aaaggctctg agaagtcctg cggtgaggaa gcctagttca ctttctctcc 11160 

cctaggattt cccaactgcg cctgatcaca gaacattttt tcatttccac tcaggaaaca 11220 

tattttgaaa aacactggcc tagaggcaga agtgaaatgg aaaacacaaa agtaaaactg 112 80 

aacaggaggc actgggcaga gaacggtcag aggcgccctg aatcctggac cggtggagat 11340 

ccccagcttg gcatgctccc ctccctgggc ccagaccgcc tccccccatt tcctggataa 11400 

gaaggctaat gcgcatcagg gtgaagggct tgcctgggct acacccccag gctcgcccca 11460 

caccaatcgc gctcctgcga gagccagtga ctttcttgat ttggctactg tggaattgtt 11520 

tgcaactaac caccccagat acagatacaa atgacaggat gatcagatgt aaaggaccca 11580 

caggtctctg tgatacggct tcatgcagcc agcatggcta gtgccgtgca gaatgagaat 11640 

gaccccaggc aagtccttgc ctcccagacc cagaacccca tggagcccac cagggctggt 11700 

tcacaagcac tgtctgggtc gggcagagat tccagcaaga ggagggaaca tccatgcacc 11760 

ggagccagtt accagaagca aatcgcctct tccaaaaccc aggctattaa tggagtccac 1182 0 

tgttgagtgg agctggggtc tagctatgga atactgcaca gcagagatct tcctgagaga 11880 

aagcagtttt ccctgaaagc catgtgtcct ccactaactg tgttttaatt gggcgaacgt 11940 

ctgtatctca ttgcagtggc cgcgcatgtg ctgacaaggg gctgggggcg gggtggggag 12 000 
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cagaagctca 


qqqqcctqqq agqgaaggaa 


acaggccacc agggctcccc 


agaaggcatg 


12060 


L- d U- L. <w l, ^ 


acaaacacac qcatqcacac 


acacgtgcac acatactctg 


caagccctga 


12120 


gttagcaact 


gt.gyaat.yuy ci.v-.uciy*- u^ay 


tgatcccagg acaagctget 


agggaatatg 


12180 


-^i +- J— 4— w* -«| 4— +-* 


yat.yt.CL.yucL ctctuy L.y wy l.u 


ttcactaatt agaaggttta gggcagagca 


12240 


y d.y CLCLCLCLCL 1— Ct 


tgtatttcag agtcccagtt 


tgacctgcca gaaaccagcc 


cattactaac 


12300 


attrttat tt 


tcaacaaaat atagcattct 


gattacatac catcttggtt 


ccacgcctcc 


12360 


l>^h »-» v ^ y 


agcccccgga agcggcccaa 


ggccatggca aatagtgaga gaaacagttc 


12420 


cayyy Lyyay 


actgactcag gggtgtcagt cagtggggcg ctgatggccg gtgggaggcc 


12480 


cty u cty l. u ci l. u 


accctctcct tgggacagtt gagtagctct cccccagggt 


catgtggcca 


12540 


CtCaygt tCa 


tatgggaggc gagaggagtg gcagagtcca ggagagtggc 


tccgaagtca 


12600 


^ i, y t. u u o u \_ 


caggcctcag tgtcttcatc 


cattaaatgg gtaggctgag 


gtctgggatg 


12660 


acaagy ayyy 


ettgeactta ctgaaaccca 


tgggaggctg ttcgecgatt 


tcttttattg 


12720 


ciuyy clciy clclcl 


acactegtat aattcaagta 


ccaattaaaa ggcaggcact 


ggaaccaccg 


12780 


tctgccaatt 


cctagttttg cctataccaa 


atttgagcaa gttaattgac 


ctctcccagc 


12840 


ctcagtttct 


tcgtctgtaa aatgagggta 


gggatggccc ccagcccaca 


gggcagctgg 


12900 


aaggattaaa 


gaaatcaaac atctcttaga 


gcccacctgg cacactgtga 


tacacaacaa 


12960 


atgttagcta tttttgtcta tgaagtctag 


attttatatc ttgggtgttc 


taaagcagga 


13020 


tacatttatt 


taaaaacaag gattttcatt 


aaacaegtae cccacagaca gcaaccccat 


13080 


ggagactgct cttaattcag gccagtatcg aaacgactct aactacaagc 


tttatacagg 


1314Q 


tctcttggct 


gtccttcaaa tccaactaag 


gtggtacttc tgaagcactg 


tgcacatgtg 


13200 


tgtgtgcatg cacacgtgtg ggaagggcgg gctcacggat ccctcaggta 


ccccacccac 


13260 


gcagtctcaa 


gtcacaaagc gacagagcag 


ccgaggaagg tctgtgcccc 


actggaccct 


13320 


cgtgaagcca 


ccaactctac ctctgcgccg 


tgtcctgcag actgggctac 


cctttgggtg 


13380 


gggaccagca 


tttgatgcaa gaaaggcaga 


cagaaaagga aaagggcaag 


ttcgactcca 


13440 


gataacacag acagtaccaa gccccagggt 


ecataaatge caegcagatg gaagcattta 


13500 


ctgcgaggcc 


acacagcaaa cgcacggatc 


cagggaegga ggtgcagact 


gcggtgcccc 


13560 


tgagccatga 


ccctgcaaat taccaccatg 


ggaaaggagg ctgccaaacc 


ccccgacagt . 


13620 


cggctgggct 


ggcacagact cgtggtttcc 


atcgaggtgg gaggaggtgg gacgtcccag 


13680 


cccctccccc 


atgcccactg cagagggaag 


cggccgtttc ccctgtgtgg 


ttacaaaggt 


13740 
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ctcattgttc ttcctcacag ggaggaaact ggaggaccga gctcagaacg cattttagaa 

■ \ ■ • 


13800 


ctggcagaaa 


agaacatctg gggaaggaaa cacatttcag aaacaaacat acctttgtac 


13860 


cagcttttat 


tttctttaag tgttgaaaaa ataataataa taaagacatg ccaaatttat 


13920 


catcgctcta 


caaaatccct ttattgagca aaacgtggca gctctacttt caaatgatta 


13980 


ctgttcctgg 


aaaattgcag caacgtggat gccaaggccc gaaggccgcc atcagcagcc 


14040 


aaacaaaaga tgccacctcg ggctccgcga cactgtacca tgccagggaa ctggacagat 


14100 


ttggggaatg 


ccacggtttg cctttaaccc cttgcctcct ggtctcctga tgcatctcag 


14160 


aggctaacat 


tctttgagga actggcattt cttagttgta aatatgcatg tgggtttggg 


14220 


agctgcctgc 


aaagtccagt gttgacgatc agctttgatt tccttggaat caagtttacg 


14280 


tgtcgagtct ggaagttaag aagaatttgg agaagctgag cactatggtg ttgcaggccc 


14340 


tgggtgaact 


cttccaccaa gcattcattg tggactgaca gcgtgcgagg ggctctgcag 


14400 


gcaggtgcac 


aggacgaaac acattccgtc cgggggaaac ctgcaggaaa gctccctctt 


14460 


cttcctaagg 


tgccgggcct agcttcatgg. gtccctaccc tccacgcctg tcacactttc 


14520 


tgagtctcat 


gtgggagctg cttctggttc ctgacttcac tcagtcctca taggaggtgg 


14580 


aactactgtc 


accccatttt acagatgggg agactgggca caaggggacc aagaaaccaa 


14640 


tgcaaagtca 


cacttgtggg atcagtgaca ggggagatca attcccaggt tctttctgca 


14700 


agagttaaat 


tgttttcatg ctgcctaagg gggggcaact gaaagaccac tgcatatctt 


14760 


tgccaaaagg 


gtcaagcaca ggagccgcag ccagtgggtc agatccgcag aggcgctggg 


14820 


gtgaccctcc 


ccatacctgg agggatgctt gtcccctcct ggccttcact gggtcccctc 


14880 


atgaccgtgg 


cctcccagga cctcagcaca atcccggtcc tgtgctccag gacaagccct 


14940 


ccgtccccaa gactgtgagg aaatggaacg aagaggggct cgctgcagcc cagcacccac 


15000 


actgcccctt 


ctcaggggca agaaccgtcc tggaggactt ggctttggag ggggagcctg 


15060 


ggaggccagt aagtcaacaa gcctctactg ctcatgggtg ggatcccacc gcaggccccc 


15120 


acctgctggg 


gcgggcaggg acgggcggca cagcttggcc agggcagata acccccacct 


15180 


tggccagggc 


gaaggcagga cacgtgggct ccagcctggc cccaccatcc ctgcacaaca 


15240 


ctgggcaaag 


tccacgtttt cctcaactgg gtgttgacat ctgcaggaca ggggcatgga 


15300 


ggtacagagc 


gctgaagcca cacagcaacc taggagcgag actccatgcc tccccgggga 


15360 


cccctcccca 


ccatgaggac catgaaggct. tcccatgtgc cgcaaggact ctggtgtgga 


15420 


gacacacgtc 


tcctacacag ccaggcctaa cgctcttgta actgggtggt cccacctggg 


15480 
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ctcacagctg gagggccagg agctcaaggc 


ttcgcagggt 


ctgctctcat 


cccagaggcg 


15540 


atggggagcc 


acagcaggct gcaggagaga 


gggtgggccc 


cctccacttc 


agaggcccca 


15600 


tctggcccac 


agactggaga gcacatctct cagcaaccac ggagcgccaa 


ctgcgcacag 


15660 


ggcctggtcg 


tcagagcggg gcaaaggcac 


tgaccgtcac 


ggccagggcg 


agggaagacg 


15720. 


ggtgggcagg gaccttgggc agagggggaa gaacctggtg cccaggctgg 


ccctgccttc 


15780 


agcagtgaag 


ctgagtgggg aggcgctgat gcagggggcc 


agaaagggct 


gctggtcagc 


15840 


CQQQacrcracrc 


cccccacaga ggaagcagcc 


agcccagacg 


cagatggcag ggtcccctca 


15900 


acaatgtcct 


ctgaaaagga gaggcgggga 


ctgctctggt 


gacacctaca 


aatagatagt 


15960 


cagccctcag 


ccccctgcca tacttctgac 


aaagcagagg 


cccccagggg 


aggcgcaccc 


16020 


gaaggtacct 


gcacctgtcc cccagactcc 


tagagcccac 


ctgaccccat 


cccaccaggg 


16080 


ctccagctac 


aaaataaatg ccgaggccag 


ctaggcaagg 


acgcacactc 


ggtaccgact 


16140 


gaataggctc 


cacgttgtca tgagcgcaac 


ccacaggcca 


ccaggccaca 


ctatgcagag 


16200 


ctgagatggt 


ttcggccaag cagcctctca 


gctgagctga 


acaagtccag 


agtccccggg 


16260 


gggtcgtcac 


tatggagtaa caattgcgat 


gcgatggtaa 


ccctaacagc 


taaccgtcac 


16320 


tgagccaggc 


cctgagctag gtacttttca 


acgctgcctc 


tctgcagcct 


caggacgagc 


16380 


ctgtgggagc 


ataaagatca ttccctatca 


cggatgggga 


aactgagctc 


tgaagcagtt 


16440 


aacgtgcttg 


tcccagaccg cagagctagg 


agcaggacac 


aacagcaggt 


caggcaggaa 


16500 


cgggtgaggg gggcctgcat gggcttctct ggaggctgcg catacacgca 


acccccagga 


16560 


ccccgaccct 


gcacctgcag ctcgctactg 


. ccccctcagt 


gactccagca 


aacctcgggg 


16620 


taggggaagg 


aggctgggaa tacctcgggt 


gtccgaaaca 


gcagcttctg 


cttggaggcc 


16680 


actgctgcat 


aatggttgct gcccagcaca 


ccccaagcca 


cctgtgccac 


ctgtggtgac 


16740 


cttccagcat 


gccttggtga ccaagctggc 


cttaggtgct 


gtgggcagcc 


aagaatagaa 


16800 


cagggcccac 


ccctcctctt cacactaaca caaagcaaga ggcgggcact 


tcgactgagt 


16860 


gcatccctct 


agctcaaggg cctcacggat 


cacaggggtc 


agggcaagat 


cccaattctg 


16920 


cattcccgtc 


tgcctttcat cctgctctgc 


caacaacagc 


cagtgaggct 


ggggacatcc 


16980 


ctgaacctgt 


ttctcacctg aaacacatca 


taccattgga 


ccccagccct 


ccgggagagg 


17040 


ccctaatccc 


tgactgtggt gagatcagat cactggttaa gtacccagaa gggccttggt 


17100 


caggggctcc 


aggggtgggg ggtgatgggc gtggtggtat cccgctctgg gctatagtcc 


17160 


accctgatgg 


aggaggtctg tggtcagaac 


cgggctgtgc 


agggcacagg 


agcccagagg 


17220 
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gacccccaga 

V 

acagcaagga 
ggatcctgga 
cccacggggc 
ggcacctcct 
gctggacgat 
acctttcaaa 
gcaccccttg 
gcccaggtgt 
tggctctctg 
acagggcaga 
cctcccctgt 
cagggtgata 
caggtagtca 
ggactagagc 
gacgcagcgg 
agtccctgac 
gtgggcggca 
aggcacagcc 
cagcgaccag 
cttcgtgttc 
tttaactcaa 
ttgggaagct 
atggagggac 
tgtgcctgta 
aggtagaggc 
agactctatc 
aatcccagca 
gcccagccaa 



gctcacctgg 
ggccgcccag 
"gatctctcat 
cgtgaaggtg 
ggtgtggagc 
gagcagaagg 
acagatggaa 
tgcctccctc 
ctcctgaggc 
gagctctgcc 
gcccagctgg 
ggacgcagct 
tgctgtcaca 
cttgctccac 
tccagcagct 
acggcaccaa 
cactcgctct 
gtggccacag 
gggacccagg 
gtggcaagtg 
aaattcttga 
aagtaccaaa 
gaggcaagag 
cccatttcta 
gtcccagcta 
tgcagtaggc 
tcaaaataaa 
ctcagggagg 
catggtgaaa 



tggtctctga 
agcccagcgc 
caccctccaa 
tgctgagtcc 
aggcggaatc 
ggctgttgct 
attttttttt 
ccatcgtgac 
actgggggcg 
ccaggccatc 
gtcagcacac 
ctcactatcc 
gaccactacc 
agggacaggc 
cagccctcct 
agccaccaca 
ttgtaaccag 
caaatgaatg 
agtcctccgg 
catgaactcc 
gaacgcatta 
gtagccaggc 
gatcacttaa 
caaaaaaagt 
ctcaggaggc 
tgtgatggca 
tttaaaaagc 
ctgaggcaag 
ctccatctcc 



gcagggctcc 
ctagcaccca 
gtcagtcatg 
aagaaggcct 
ccaccagcct 
taataaacgt 
aattaaaact 
ccatcctctg 
ggtgagaatc 
agggcacacg 
agggccacac 
caccccacta 
agagccatgg 
aacgccgcac 
gagaaggaga 
agcccacggg 
agctgcagtg 
agtgtcccaa 
cactgcagca 
cgggggcata 
aaaatatcac 
gcggtggctc 
gcccaggagt 
gttttaaaaa 
tgaggcggga 
ccactgcact 
accaagccag 
tggatcacct 
actaaaaata 



ctcaaccctc 

gtggcgtgcc 

cccaacccag 

tcgacactgg 

ctgctctgcc 

catttcctta 

ggtggccaaa 

cacacctcaa 

cgtgagccct 

ccgggcaccc 

tgggcacaca 

ggtcccgggg 

cctgctgttc 

ttgggggctg 

actccatgct 

gccctgcatg 

gagtctacga 

gggagcaggc 

aactccctgg 

acctgggagg 

tcagtcacct 

acgcctataa 

tccaaatgaa 

attacctggg 

gaaccacatg 

ccagcctggg 

gcttggtggc 

gagtcagaag 

caaaaattac 



agagaaaagc 

agacctgcct 

ggacccacag 

gaagccaagt 

agtgggcaca 

agaggataaa 

gagatggaaa 

gctgttcgct 

cggccagccg 

tgggggccac 

agtctctgag 

atctgtccca 

cgcccgcagc 

ctctgcggca 

ctaagaggca 

gcaggtcagg 

ggcaaggact 

ggctgcgggg 

gccccctgag 

gtgacactct 

actctatagt 

tcccagtact 

cctgggcaac 

cctggtggtg 

aacccagggg 

taacagagtc 

tcacacctgt 

ttcgagacca 

ccaggcgtgg 



17280 

17340 

17400 

17460 

17520 

17580 

17640 

17700 

17760 

17820 

17880 

17940 

18000 

18.060 

18120 

18180 

18240 

18300 

18360 

18420 

18480 

18540 

18600 

18660 

18720 

18780 

18840 

18900 

18960 
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tggcgggtgc ctgtaatccc agctactcag gaagctgagg caggagaact gcttgaaccc 19020 

aggaggcaga ggttgcagtg agccaagact gtgctactgc actcaagcct gggagacaga 19080 

acgagactcc atctcaaaaa ataaataaat caatcaaaac caccaagact ttttaatata 19140 

aacatttatt attccataat tccttttttg catgattaaa aatgtttata taaagtttcc 19200 

tgaaaatggt aagaatgcca agtgaaggct gcaaatgccc aagcccccac cgtggcatct 19260 

cacggagtct gggccctagg aggctggtgg gtaccacgtg gacccgagac ttcacagtca 19320 

agtccctttg gggtacactg ggtttccc.ac accccagaaa tatgggctct tactgcagga 19380 

ccatgggggt cctcacactt ggcccagaag ctgtcacata gccagacagg tgttctacaa 19440 

cctaggctag agggagctca tgctccagca gaattcgagc cagaggaggt aaaagatggg 19500 

taagatctgc tccctggaca gatgaggcct tggcctcaga acagttactg atcatctacc 19560 

agacatcaca ctagaggcag aggggcgcag acgaagacag cccctgtcct caaggccctc 19620 

ccaggttggg tggaccatgg aaggttccag acagatctgg caagagaagt gcccacacca 19680 

ggggcagaag atgggcaggt ctgctcaggg cggcacggcc tgccaggcca aaaagttcca 19740 

acttcagatg ctggagaatg ggcacgactg tctgagaaag ggaaggatgt gatgaaaact 19800 

acttggagaa aaattaatct ggccagagca taagataaat gggcaaaggg gaggttccag 19860 

aaagcaagga gaccaagtaa aagctgatgt cattggctct gaatctaggc tttcactgaa 1992 0 

tatgcaccgc agggcctgta ggtaaagcct cagagcccag ggagtctgag tggaggagag 19980 

ggcaggggac agagctgggg cctgtgtcta cagtgctcag gaggaatagg catggacgtc 20040 

. agctcggagg ctccagctga agtgaggagg cggccagggc agcacggcca cgcccggatc 20100 

cagactcctt ttgggaagca agttcgctct gggggaaagt ttggagaaat ggcctttacc 20160 

cgcagaagca agccccagaa catatcttgc tccaaaacta tctcgtacag tgaggacgtt 20220 

aagcttcagg tcccctagag gagacagtct gctccttcct ggggcagaac ccaaggtggc 20280 

cagagcctgg aaggcaccca gcacccaggc tggtgtgttc cagcccaggc cacacgctca 20340 

gatagctatt aatgccccgt tgagcaattt cctgagagct ttgccaggca ggtaccgcct 20400 

ccccatctga actaatacag gggtacatcc caaggaagaa atgaaaggtg cccacatttt 20460 

gctctgggat taactaggga ggggagtgat aattaactca gtaattatat ttgccatcgg 20520 

gctaatgcta aaattagtgt gcattagaat ttctttcctg agcagacacc ggagtgagtt 20 580 

gggcagcagg agtggctcgg gcaagtcggc acaaagggca cctccagagc cttccacaaa 20640 
tgtcagcaaa acccacaaat gtcaaggccg gctccactgc acccagcaga tgaattcact 



20700 
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tccacagcct 


gagaccgcca gctcatcgga ggccatttaa aatccagccc 


tctgacacct 


20760 


gctggatatc 


accatttacc gtccccagat caagagatca aagggtggaa cctgatagga 


20820 


cggctctgaa 


gttcaccaca aaagcataaa 


cgtgcaagca gagccaatac gtcttttgaa 


20880 


aaggacaatg aggtgggaat ttacataact 


gatcttaaaa 


tatgttctga 


tgcttcagag 


20940 


atggagacag cagcattccg gtacacaaag 


acactcacag 


gcagtggagc 


acagtgaagg 


21000 


gtctggaatc 


aggacccagg tgtctgtgga 


cactacacat 


aaaagagcag 


catttacaat 


21060 


gaatggatag gatggaccat cccaccaagg tgttggacaa 


ctccctattc 


actggecaga 


21120 


cccctacctc 


ataccatata caaaaaaaaa 


aaaaaaaaaa 


aaacccagac 


agaataatgt 


21180 


ctgaatgtaa 


aacataaaac agtaacagtc 


ctggaagaaa 


ataatggagg 
i 


atatatttat 


21240 


aatctggaga 


tggagtaaca agggatagga 


aaaaagccat 


agggaaaaag 


tagagttatg 


21300 


attatatgaa gcttcttaat atctttatga taatgtacca ccagaaacaa ggatgaagga 


21360 


ctagctacag 


accagcagtg aaacctgaaa 


caaacagaac 


aaagaattaa 


agtccatacc 


21420 


aaataaagac 


ctcccacaaa tctataagaa 


aaagataaac 


aggctggcac 


cgtggcttat 


21480 


gtctgtaatc 


ccagcacttt gggaggcgga 


gatgggtagg 


tcacttgagg 


tcaggagttc 


21540 


gagaccagcc 


tggccaacat ggtgaaaccc 


tgtctctacc 


aaaaatacaa 


aaattageca 


21600 


ggcgtggtgg 


cgcatgcctg tagtcccagc tacttgggag gctgagccag gagaacagct 


21660 


ggaacccggg 


aggcagaggt tgcagtgaac 


caagatggca 


atcgcgccac 


tgcactccag 


21720 


cctggaggac 


acagcgagac tctgtctcaa 


aaaaaaaaaa 


aaaagaagaa 


gaagaaaaaa 


21780 


gaaaagaaaa 


agacaacaga aaaatgggcc 


aaggataagt 


gtaggcaatt 


tgcagaaaag 


21840 


taaataccaa taaaccagaa atgagggttg tgcaaatcaa aaggtgttat 


aatttttaac 


21900 


caaactggac 


caaagaaaac accaaaaacc 


aaaatcttgt 


aattgecage 


atcagagagg 


21960 


atataggaaa 


gtgtgtgttc tcgtagatgc 


ttgcaggtat 


gaactgetae 


agecttttag 


22020 


gagttatgta 


tgtatgtatg cttgtatgta 


tgtatttgag 


acagggtctc 


gctctgttgc 


22080 


ccaggctaga 


tctgttgcag tgctgtgatc 


atggcttact 


geagecttga 


cctcctgagc 


22140 


tcaatagatt 


ttcccacctc agcctttcaa 


gt age t gaga 


ctacaggagt 


gtgeaatcat 


22200 


actcagctaa 


ttttttaaat tttttgtaga 


catggggggt 


ctcccaattt 


tgcccaggct 


22260 

£i £i A W w 


ggtctcgaac 


tcctggactc aagtgatcct 


cctgcctcaa 


cctcccaaag 


tgctgggatt 


22320 


acctggatga 


gccactgtgc ccggcctcaa 


tatctttaaa 


aacagaaatg gacacactct 


22380 


ttgactagga 


atgtatccta taaaaacact 


tatacacatg 


cagagacaca 


cgagcaagca 


22440 
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tgctttgtaa tagcaatgaa ggctggaaaa 


actcctcaat 


caggtaaatg ctgtcaagtg 


22500 


cacctgtgta 


ctatgaaatg gcacttggct tttaacaaga gcaaagacag aaaagcaaaa 


22560 


gtacaaagta gggtgtgatg gcacatgcct gcagtcccag 


ctactcagga ggctgaggca 


22620 


ggaagatcct 


ttgagcccag gagttggagg ccaggagctg ggcaatagtg agaaaaaata 


22680 


aaattaaata 


ataataataa taaaataggc 


tgggcacagc 


ggctcatgcc tgtaatccca 


22740 


acactttggg 


aggctgaggt gggaggatcg 


cttgatccca 


ggagttcaag gccagcctgg 


22800 


gcagcaaagc 


aagacaccca tctcaacgac 


aaattttaaa 


aaatcagcca ggcaggctgg 


22860 


gcatggtggc 


tcacgcctgt aatcccagca 


ctttgggagg 


ccgaggcagg cagatcactt 


22920 


gaggtcagga gttcgagacc agcctggqca 


acgtggcaaa 


accctgtctc tactaaaaat 


22980 


acaaaaatta gctgggcatg gtggcagatg cctgtagtcc cagctactga ggcacaagaa 


23040 


tcgcttgaac 


ca 999 t 99 ca gaagttacag 


tgagccgaga 


tcgtgccacc gcactccatc 


23100 


ctgggcgtga 


gtgagactcc tgtctcaaaa 


aaaaaaaaaa 


aaaaaaaaca aggagccagg 


23160 


cacggtgggg tgagggaggg cacagaagca gcgcctcttc tgggggcacc cccaatctct 


23220 


agcgatccag 


aggcctcagg atcctgaagg 


gagaaaaaac 


gtgaagctcc gtgctagaag 


23280 


agaccataga gattggaatc agctggttct attttacaaa aaaaggaaac tgaggccctc 


23340 


agaaggtgag 


tgcctctcaa tgccccacag 


ggaggcaggg 


agagggctct gagccctgca 


23400 


gggccctgga ttcttgcaat ggggtggagt ggagcctgtg ccgcccccac caggcacctt 


23460 


ctcaggagag 


gagccgttgt catatccttg 


aaggggtcct 


tgagcccctc aaaaggctaa 


23520 


aaaccacttt 


cctccttgag tgaaccttca 


cctcagttta 


accacaagaa aaactacatt 


23580 


aaggcccagc 


gcagtggctc atgtctgtaa 


tcccagcact 


ttgggaggct gaggtgggtg 


23640 


gatcgcttga 


gcccaggagt tcaagaccag 


cctgggcaac 


atagtgaaac cctgtctcta 


23700 


caaaaaacaa 


caaaatcagc tgggcgtggt 


ggtgcacacc 


tgaggtccca actacttgcg 


23760 


ggctgaggtg 


agaggattgc ttcagcccag 


gaggtagagg 


ctgcagtaag cggtgactga 


23820 


atcactgcac 


tccagcctca gcaacagagc 


aagactcaaa 


aaaaaaaaaa aaagcaggcc 


23880 


gggtgtggtg 


gctcacgcct gtaatcccag 


caccttggga 


ggccgagcgg gaggatcagg 


23940 


agatggagac 


catcctggct aacacggtga 


aaccccgtct 


ctactaaaaa tgcaaaaaat 


24000 


tagccgggcg 


tggtggcggg tgcctgtagt 


tccagctact 


caggaggctg aggcaggaga 


24060 


aaggcgtgac 


cctgggaggt ggagcttgca 


gtgagctgag 


atcacaccgc tgcactccag 


24120 


cctgggcgac 


agagcaagac tccatctcaa 


aaaaaaaaaa 


attaaatctc aaaaaaaatt 


24180 
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acattaaggc 


aaactaaaag 


atgtttaaaa 


tatatatatt 


aaattaaata 


cactccaata 


24240 


gagcaaatac gaaaataccc 


agaaaacaca 


atccccgcac 


ccccaggaca 


acctcccagg 


24300 


gggtccacag 


caagagaccc 


caagcacgag 


agacagagaa 


cagtgtccct 


gtggcggaac 


24360 


ctctggccca 


tcaggctcta 


ttagaaaata 


aggctcttgc 


cactgagaga 


aagaggcaca 


2442 0 


gtcgcccagc 


agccacgggc 


tctggcacac 


cacgagtcag 


gccagcaaag 


tgtcaactgc 


24480 


cccctacaag gtgacaaact 


aggacaaact 


ggaaaccaga ggctggacct 


ggagcacagg 


24540 


gaccaccaca 


tggggctggg gaatgggcag ggacctcaga gcgccaccca 


catgcctaag 


24600 


agcagcgcgt 


atgcgcatgc 


ctctgcatgg 


cttagggaca 


cagggagctc 


cccccacccc 


24660 


caacccagga 


aggcagcccc 


cactacccag 


gtagggaacg 


gataggacca 


gcaccccgtt 


24720 


ctgctcgtaa 


ctcagggctc 


caggccccct 


cgggggcaac 


cagcacagag 


ctcagacccc 


24780 


aaatatcttc 


acccacctec 


tggtccccat 


ctggacaagg gtgctgggga ctggctctca 


24840 


gtcacaccct 


cggggtactc 


ttcaaaggac 


agctggatgc 


cccagggcag gagcttttgg 


24900 


cccccagctc 


cctcacccca gacaccagct 


cttgggaccc 


caccagcatg ggcaaggtgg 


24960 


acaccatcgt 


cccgattttg 


cagatgagga 


aactgaggct 


gagggctggc 


acacggctct 


25020 


ccagagctga 


agagaatgca gagagcagcc 


ggagccagcc 


ggtgggtccc 


tgaggccggc 


25080 


tcgtagcaag 


ccacagctgc 


ctccgcccat 


cacacttgga 


cctcactggc 


cccaggacag 


25140 


ccctccaggg 


cggcctggca 


cagagcccac 


accctgctgc 


ttcctgaaca 


aataagtgaa 


25200 


caaggccacc 


aagccgagga 


cctggatgta 


gccccggctc 


ccgccagggc 


ctccccaaca 


25260 


gactccccat 


ttggagagcg 


cattaagtgt 


ttccaaagcc 


tcacaaacca 


cagatgtccg 


25320 


gctgtctcac 


ggcttctgta 


acctgaactt 


ggccctcact 


ctgccctccc 


agcactcctc 


25380 


tcagggccca 


ggcccctcct 


ctgagatgcc 


agcactgact 


ccccaacttg 


tccccatcac 


25440 


ctggctcgtt cctgaacctc ggcaggagag tctcaggcca gatcctccca ccagccacct 


25500 


ccaccaggat 


gcaggaggca 


tgagacctgc 


tcgtgccggc 


tgggagatgc 


aaccaaccaa 


25560 


gatcaatcca 


atcagcggat 


gaactgacaa 


atataatgtg 


gtccctccac 


acaatggaat 


25620 


attattcagc 


cacaaaaagg 


gctgaaatag 


gccgggcgtg 


atggctcaca 


cctgtaatcc 


2.5680 


cagcactttg ggaggccgag gccggcagct 


cacttgaggt caggagttca agaccagcct 


2574 0 


ggccaacatg 


gtgaaatccc 


gtctctacta 


aaaatacaaa 


aattagctgg 


gcgtggtggc 


25800 


gggcacctgt 


aatgcaagct 


acttgggagc 


ctgaggcagg agaatcactt 


aaacccagga 


25860 


ggcagaagtt 


gcagtgagcc 


aagatcgcac 


caccgcactc 


caacctgggc 


aacagagcaa 


25920 
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ataaaaggct gaaacaccca tacgtggtac tacttggatg 


25980 


actcctgaaa 


acy t ud-Ucty u 


aaccaaggaa 


gtcagccacg aagacgcatt 


gtaagattcc 


26040 


CttCaugCaa 


daUy a.y d 


acaggcagaa 


ccacagaggc agaaagtcga 


ctggtgttca 


26100 


CCayyyy alC 


prfrrnrrarrarfn 

cggggagagg 


gaacgggaag 


tcaccgtgta atgggtatgg gttttatttt 


26160 


yyygt-yaugy 




taacttgata 


gaagagaggg ttgtaaacac 


tgtgaatgta 


26220 


ccaaatgcct 


gccttctata 


ctttaatatt 


ttatattata taagtttcac 


ctcaatttaa 


26280 


aaaaaaaaca 


actcgacacc 


tttcacctag gaaagatctg gctttagctt gcatttcctg 


26340 


taactcctgc 


ctaaagcctt 


ccagaagctt 


ccgctgcctt gtggatcaca 


accagactcc 


26400 


acaccatgat ctggcctcta a'gggcctcfc gcaggacacc ccgagggtga aggagcaccc 


26460 


gtgggcccac ctctgcatag 


ctgcaaagct 


tctttccctg tcctcccctc 


tacatgggaa 


26520 


gctctgcccg 


caggggcggg 


gccttatctg 


ccattctatc gcactcaacc 


ctagcacttc 


26580 


actcggtagc 


agacaccaaa 


gcaaaacagc 


aacagcatta taccgggcca 


ggtgcacgtt 


26640 


aactcactga attcatggta ggaaggattc 


tattcccatt ttacaggtga gaaaactgag 


26700 


gcacacaaag 


gtagcatcag 


cttcctaagc 


ctcccagcac aggaagcggc 


caggctggaa 


26760 


tcagaccctg 


ggcgcagggg 


ctctgtccac 


agtgctaact aactactcct 


gcccccgagg 


26820 


gctgcagcgg 


tgagtgagtg 


agtttgtcag 


tggactggat gtccaaggtc 


atacaggaaa 


26880 


aatccagact 


attgtaataa 


cagcctctag 


accggctggg gccagaaaga 


tcgaggacgc 


26940 


tgacacacaa 


ctgcgctcac 


tgcagctctg 


ccagggatgg ggctaaaggt 


ctcacacagg 


27000 


gcagttaggg 


ctccccatag 


cctgggagag 


gaacggggtg agataacaga 


aactaggtat 


27060 


ggtgcccgaa 


gtcaaacagc 


cactgagcat 


gtaaacccag gtgggtctga 


ccccaaaccc 


27120 


ctccaccccc 


atcagccctg 


caacccgtcg 


ctgcaaggga gaaagcaact 


cagaggcctc 


27180 


acctgcctac 


atcccccacc 


cgtgtgtgtg 


agttctacta aatgcctgag 


cagtgacaca 


27240 


gcacggctga 


aattaaacgg gttccaaaaa 


cgacaggaag cacgaagtga 


atctccccag 


27300 


gaaagtgctg 


aacaaatgct 


ggatcgggtt 


caccggcgaa tttcttggaa 


ctgaagaggg 


27360 


gagctaaaca 


cacggggiccc 


tgctttggag 


gggactctct cagggtgctc 


cacacagcac 


27420 


ttggttaacc 


ccactcagcc 


cttctgggct 


ctcccagagg gcccggcctt 


ggccttgggc 


27480 


atctacagga 


ggaacctcca 


ggggg a 9 a 99 


gggtgcctgg acaggccggc 


cctggaacaa 


2754 0 


gcacttgggc cccgaggaga gaggactagg gcttgggagc tggggaagtt ctcagcactg 


27600 


ggaccactag 


aacaaagcca 


tttccgtgcg 


ttcacagctt ccaattgcaa 


caggaagcaa 


27660 
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+- /»i ft ft ^ -a -a ^ -3 


4-aaf f a rrppffT 
Lddu Ucty tyy 




OQC t tCGCtCf 


aaatcccracTQ 


catgtatttc 


27720 


a r" 1 a rT"h a a a 

GuQ.^ Uddd 


a ppacrcrcrafca 


taacatcaaa 


accgttctgc 


agaaagattc 


ctccctttcc 


27780 




yyu-u uyy duu 


a rra rati tea 


ctggggct cc 


caggccttgc 


tgcctaatgt 


27840 


t-aaaa^haat**" 1 


a s pt~ pt* a t" t" t" 

ClCl UUUUGUUU 


ttacc tcaca 


cacaactgaa 


ctctacagct 


ataattcttt 


27900 


uit^ut ciy y y 


pypf - pcra PiCCri 


catcfaaccrac 


aggcatttga 


ctccagcaac 


atcaccccaa 


27960 


aacgfcgcaca 


daaCCCdoda 


u uyuctct uy cty 


a t* cr 3 3 3 crcr c a 

~3 U^J CL d y y ^> Ci. 


a c o c acr fc c cr o 


cctagaaacc 


28020 




GctUdctdUdy u 




cccct tttcrc 

w w ^» w w 


ctccttgacc 


caggcatttc 


28080 


uy y dctctctcty y 


a n f n r* \~ n 

ciy u»yyu.y u uy 


gec tg tacfcc 


cccagafcac t 


gfccgctgttt 


tgtcttcacc 


28140 


t- f- rr t* t- 1- t- rr n f- 
u uy Lutuyut 


cty u u u u cty a u 


aaaaccccac 


aatgtaaaca 


cgctcctgaa 


agaggcagat 


28200 


^^yyyy y dd 


a pt~crt* ppat - a 
d. u uy .u *w o ct c ct 


aaatc tctacr 


gcttgggtca 


QaaqcacTQacr 


gaegtgaaac 


28260 


-*\ ^ ^ /— » 4~ *^ ^ 
aaacuccaag 


ctcccot- L,y L. 


uu-u-u-uyouyu 


u u u u u a. L.Q.^^ 


t~ p p a a ctp acta 

^> v_» \ur u d y ^ cty d 


ggctgcagcc 


28320 


tggggga t c t 


gac tacaggg 


wu»ctu-u.u.u.y u» u 


nparrahhra 
y u ct u u ct u u u ct 


part nna aaf 1 
u ct u uyydddu 


a t~ t - paacrcracr 

ci. u> l> >v> cty y y 


28380 


acagc tgttt 


gecrxaagga 


fx ft f* f r~i anapa 

y y u* o u dy ctu* ct 


ctcty y y y u u uy 


acra"f~ ppf" ppp 
dy y u u u u u u u 


pcrpi~aaacta 


28440 




a a /"< a ft'fx a rf r 

ad u cty y ciy u> u. 




r*acrcrcact tcr 

w vi wt. w 


cggccgtgcc 


ac ttggccag 


28500 


CCaCdL UCCa 


/~y a -a a o a /**• a a a 
y dddddu-ddd 


a r , a.rTTr*a.f'a.1 - 
ctuctoy uctuct u 


ppcra a era era a 


t*cra t* 1 - tacfQt 

c* c. ^ wuy y 


agcaagaggc 


28560 


4— /"I ^ v~Y — < — > — « 

CtyCttgaaa 


^^*^^"t^/~ia4" ft ft 

aaccaca uyy 


UCtCtUU UUU.CJ.Cl 


at"haaa a era a 
d u u ddddy d d 


pa t*crt"Cft" acre 

u G u y u y i— ciy 


crtttcaccrac 


28620 


U y U U U ct cty U U 


lll uy cty u u u 


l» u u- uy CL ^> v>> w W 


aaft - cca.cicc. 

Ot Vw* V-y ^ 


cctcraQaaac 


accaaaagtt 


28680 


9 9 ^ u 9 ^ 9 ^ ^ ^ 


guUCCCCCy c 


uuuctuuuuuu 


uuuctuyyy uy 


uy uduciciu uy 


a crcr c a c a acre 


28740 


CtyCCUCCCC 


ft ^ /-fi- ft ft ft f \ ft f% 


r*rra +" i~ nnn^ 
uy ct uu uy yy ct 


v-* d ^— w w 


tcccrcatacrc 


c era c c c crcr crcr 


28800 


a t* iT/t a o a ft +* a 

cl Uyyata^ Ua 


uuL.yyyyciuu^ 


yy yy 


a pcrcrcrapa t"t* 

u^m y y y v_/ w. o l> 


pcraatcCTCCC 
y y y w *■* y ^ » 


aacrcacc uQcr 


28860 


-a 4*" /T "h~ r"* /"r 

C tytCay 


+* ft ft fr ft 4- +- ft ft a 

LCCyClLyyd 


a a rr'nararrr 
dcLUUuctL-ciy u 




apa rrcracTP a cr 

ci.%_> cty y cty u ct y 


cgccaccggc 


28920 


uay y CCy CCC 


ftftft ft ft ft ft fr ft ft 

cgcgcccggy 


C uCcty ddu uu 


uuuuyu uyuu 


a pi - 1" racfrrT 1 
du> u uudyuou 


cr t* p p t c crcr a cr 

y uv->v-r u *w y y ct ^ 


28980 


cacgcggggc 


ggccgcgcgg 


/~< ft ft ft t~ ft ft a a a 

ccy u uyy ddd 


u cty y u u uy uy 


ct ct u uy y u uwu 


p p crcrcrc c a crcr 

L " u yyy v -' ua yy 


29040 


cccgcc fcccg 


cgccccaagt 


ft f* ft ft ft ft 4- ft ft ft 

ccccyuuuyy 


^-ftf^ftfiftfif* ft f* 
u y u u uy y u- u u 


rinru" 1 a r a pot 


yyu.uudyu.yu 


29100 


gggcucggcc 


ft ft ft ft 4~ ft ft ft ft ft 

cggcccccgg 


c u u u. u u y u y y 


err 1 1~ cnrtrfr x ^ ct 
yu uuyyyudy 


cr 1~ era o era p c c 
y uy dy y du u u 


crpcccrcQCCcr 


29160 


c.acctggcgg 


agcgggcgcc 


ctcctcgcca 


gcccgggacg 


cagcgtcccc 


ggggagggee 


29220 


cgggtgggga 


gacaaagggc 


ccgcgcgtgg 


cggggacgcc 


ggggaeggea 


gggggatccc 


29280 


gggcgcgcgc 


cccaactcgc 


tcccaactcg 


ccaagtcgct 


tecgagaegg 


cggcggcgcc 


29340 


cgcgcacttg 


gccgcggggc 


cgcccgggcc 


attgtccgag 


caacccgcgg 


cccgtcttac 


29400 
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acgccgggcg cgggaaggta tcgaatcagg 2943 0 

<210> 8 

<211> 33769 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> unsure 

<222> (33739) , (.33749) , (33758) 

<223> Identity of nucleotide sequences at the above locations are unknown. 



<400> 8 



cttcccctta 


cactggtcct 


tcgacccgcc 


tcggatgaaa 


actgaatggg 


tttagcctta 


60 


gaggctctcg 


gtctctaagg 


gaggtgggtc 


aggatgccgg 


ggacagggtc 


ctcttcctgg 


120 


ggcaacgtgg 


gggaacgagc 


cacctacccc 


tccactgaat 


tgccctgggg 


tgtgggtacc 


180 


gacggctcat 


tcggtgtcca 


gggtctgaga 


tgtgttgaca 


ggaagaatga 


aaggggatgg 


240 


gagggatggg 


gcgaaagaag 


ccacctgcag 


ccccaggaac 


tatctggcca 


gcacaccgtc 


300 


acccagcggc 


ctgagccacc. 


cctgccagag 


ccaggaggag 


accctgccaa 


tgggtcacca 


360 


gtgtgcagga 


actcagaagg 


tcatcacagt 


taataccctc 


catgccccaa 


tgtgggaaaa 


420 


caggtttttt 


cacaacaaac 


aagataattt 


ttgttatttt 


ggcaaaagga 


ggcagggcag 


480 


ccccggacac 


ctccatccca 


cctcatcacc 


cagccgcagg 


gccccggcca 


tccctgcaga 


540 


cagagtggat 


gtcacaacct 


ccctgcaccg 


aaccaagtgc 


agctcccagg 


ccacaggcca 


600 


cccaggaaag 


gtccagtggc- 


ccccggaggc 


tcccaccgca 


ggcctcccac 


cacagccggc 


660 


accaacccag 


gatagctgtg 


ttctcctggc 


ttcttttcac 


acgggtagca 


gaaagctgag 


720 


atccggggaa 


agctgagatc 


cagggaaagc 


tgagaatcgg 


cctctgctgc 


ccggacgccc 


780 


acccccagct 


ctgctcccag 


ctccagggcc 


tccttctcag 


gtgcccttac 


aggaggcaga 


840 


gggcttgagc 


cacctcctgg 


gcctggggca 


cgcaggatga 


acggggtcac 


ggtgcaggcc 


900 


actgtccact 


gcgcagatcc 


caaggccata 


aacagcctgg 


ccacagtggc 


ttcccagctg 


960 
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n r* a n n p n rr p p 
y tdyy tyy tt 


saai-tatttt 

CLM CI I— v_ CI. L» <— 


tgttgtttag 


caattgatta 


agtttctccg 


ctgcccccag 


1020 


y yy L.o.«.y ^*yy 


t* ct csaac aa a t 


gccgcaaccg 


cagcatttga 


cccgggatcc 


tgtgccaagt 


1080 


rr a p^a h a c"tp/p; 
yaLtaLayyy 


Otata-a-ayo-ci 


caaaoa aa.cj t 

\* w*. i3 _3 i»vwt.^ 


aactaaaccc 


gatgetgget 


ctgctggaac 


1140 


CL.ya.yyC-.cyy 


r» p 3 p t* nt" far 
t CclL. uy Luau 




cctcrcfaacct 


tccagcaagc 


acagagaagc 


1200 


f- +- 

tatgg ccccc 


cdyy dy tdy t 


t~ era p 3 cr cr p ?3 p 


pi~ fccraccfccrc 


a a t c a cr cr cr a c 


fcctgtctgct 


1260 


c age t eta. a. a. 


acaggaaay u 


r'rrr^t - rrpf - p t~ p 
Lyt Lyu LULy 


p p t" nnnnt" p a 
^v^uyyyy i— •>_» ci 


cr era p a cr c c aa 
yyy^^y^^ay 


acraatcracca 


1320 


?3 /T 4— -*1 ft 4— ft ft ft 


yyooLt cty y a 


aerefcra PPt~CTP 




cttccfcctcc 


catccctcgg 


1380 


4— /^i i i ft f* ft a /-f 

cy utay c tdy 


1 1 tt uv^tuy L. 


crcrr 1 p r 1 p p p a f 


tacctacctc 


tgcccccatg 


ccccaccaca 


1440 


acc teaggee 


cacgyc uyca. 


Lyy LwaL. L.tt 


u^dyy *»< cty y *— • 


a cr t~ cr crcr cr a t a 


acrat fc tcacc 


1500 


acy c cyy CCS 


ft ft ft 4- ft 4-~ <*i +— p 

ggcxggucxc 


rTaarfrPft" rra 
y aa^ctt uy ct 


ppf p a rrrrt" na 
^ l> o ciy y l» y ct 


crcra a t" t" p p t" a 


a acr t a c t crcrcf 


1560 


attacaggcg 


tgagccaccg 


ft a ft ft f ft~ 

cgccciy LLL L. 


tot l. y Ly y l.cl 


t L-ctctcitctt to 


?i H a C* C C CCtifc. 


1620 


gctggggacc- 


ctggtgaggg 


aaCaCayCCt. 


papa arfhrra a 
tdtddy L.ydd 


rrh pf t- crcri - Y tt 
ytyL.yyL-L.oL. 


cr t" 1 - era ctp a a a 

y L- Ly ciy L»aa.w 


1680 


tgacgee tgg 


ycaycccttt 


pafrhht* np P 
tO.L^ L. L> L.y 


"h a a a a p t~ era a 


era a t* t* t* acrcrcr 


crca t cr crater t 

3^3 ^33 u ^3 


1740 


a"h*aaaapa /"ft- 

aUoaaaCayU 


4- /-f/-y4-pf o pt* "h a 
Lyy u.y ctt u tct 


a a t~ <r~f a aaaao 


3 serein P3 par 


t ccccc c ttt 


acr cr c aQQ c Q Q 


1800 


z—t /-i +— ^ ^ 4~~ 4~~ r - * +* 4** 

cc taatuctc 


±* *~\ T\ T\ ft ft /t ~3 ft 

taaaagc cay 


Ccicciyyy L.yu 


pt~ 1~ t* pt"oa a p 


ppacrcrpacac 


aatacrcftatt 


I860 


caatggacag 


c agcy y t t at 


L L-y Lett uy t l, 


pafrrpparrp 

Cl L» y CL<v w V_> 


t" cr t~ p t* crt" cjcrp 


etc tgeaget 


1920 


ggctccagcc 


tgacgcatgg 


c ty cy tttt l. 


1 1 y t ctcty y 1 1 


a ppppp/crt* a t" 
ctttttyy L. ci w 


apatcraaaac 

Ci w Cl t*cict\rf 


1980 


c c tg uggagci 


aggccttggg 


ggccggccag 


ywtyttayyt 


rraaatccca 


tctgcgccct 


2040 


tcctccatag 


acctcagcga 


gctctcggca 


p^af nhrrpp'h 

ttduy uytL l. 


fi a rrrrp p p a i~ f" 

yy l. 


f-aa rra a crt" a cr 


2100 


ggccggccag 


gcatggtggc 


teatgectgt 


aatrrraara 
aauuuuciy La 


p 1" t~ 1" crcr era crcr 
ttt L.y yy ctyy 


p p c a acrcr t cro 


2160 


gcggatcacg 


agatggtcag gagatcgaga 


p pa. t" p p t" aac 


faara p crcr tier 


aaaccccatc 


2220 


tctactaaaa 


atacaaaaaa 


taagccgagt 


/^v 4~ ftft^ fTft flfTfT 

gtggcggcgg 


y ty ct Cd tdy 


tttctcty t L.cit 


22 8 0 


tegggagget 


gaggcaggat 


aatcgcttga 


yctcaycayy 


tdy dyy t L-y t 


ant" 1~ acrpcfcra 
cty l. LctyL-yya 


2340 


gatcgcgcca 


ttgcactcca 


gcctaggtga 


cagagagaga 


tL.tL.yL.tL.td 


Ctt UCLCLCLGLCLCLCV 


2400 


aaaaaaataa 


aaaaaagaag 


cagggccagc 


ft *a ft ft ft a p ft a p 

Cdcgy dtydc 


pppi+~pa papa 
tttL.tdtdtd 


p;p t - p p p a nrra 
yt L-tttciyyci 


2460 


cgcgtgcctg 


ggtatagggc 


tcaggaccat 


gaccgctgca 


gtggccccca 


agaaaegtta 


2520 


cttttgtcac 


ccaccccgcc 


tcagtggcag 


tagecaaaat 


aacggattag 


aatggaacca 


2580 


tgtgacaatg 


ccactgcccc 


aactgacaga 


agatggctat 


cagcagttca 


cgcggcccca 


2640 


cctatcacaa 


gtgeagggea 


ctctacaact 


tatgeatect 


tccccagaca 


ccgtcctttc 


2700 
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gaccctccca 


ggtcagcaag 


gcacacaggg 


cctacatttc 


acagccacac 


agcagagggc 


2760 


t gaggc t gga 


acteggatge 


tctgatttcc 


gttcaatcac 


atccccagag 


gtggcacaga 


2820 




ttctcttaac 


aaagtcaaga 


aagtcactgc 


cagctccact 


gaagaccaaa 


2880 


aaarri - pa err* 


fct*caaaccc 


tcttgaaggt 


gttaccgaac 


tctcccagcc 


tgtttcctgg 


2940 


y U. W W t y CI y 


t" era t* p p p a t q 


cfGacacaGcra 


agaggaagaa 


gctccctaga 


geagagectg 


3000 


p/l - (™f p 3 p p 1~ rrp 
y uyoGUL. uyu 


raria pt" p t~ pa 


era aaac t" csco 
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agtctcgact 


26940 


caaaagtcac 


caagaggcct 


tcccagcacc 


tgagctccga 


cggaagcccc 


tcgccacagc 


27000 


acccaagcac 


tgctttatcc 


ccctacgcac 


acgtcccttt 


caaatactat 


tcatttacca 


27060 
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fccfcccfcccca 


ctcactgaaa gggecagaga ctgggctata cccgctgcgt ggggagcagg 


27120 


accaggcgca 


agggctcaca aatgcagtgg atgcctggtt gggaggtgag ggagctgcag 


27180 


cgacccacgc 


tgggagggaa cgcaatgaca ggaggagege aggtcctggc gacacgatgg 


27240 


ccatggcagc 


cgctggtgag caaccgcagg ccggccctgg gagagggctt ctagcaagct 


27300 


gctatcttca gcctctccga ctactgcaga tgccccctcc tagccagaga cactgctaca 


27360 


ccagccgacc 


cttccaaaaa gaaggtcagt aaccccgcga ctcctggagc cacagtgcag 


27420 


ggggagaggg 


ctgagagggc aacagttcac caageggaac agaggctgee ceggaggtea 


27480 


gctggctccc cggcagctgc aggggtggct agcccactcg gagggcagcg agggcatacg 


27540 


aggggctcca gggatgagtg gttgcccagc acagcacccc tgggaggccg ggggcacttc 


27600 


\~ r> a nni" a crt" cr 


ggggcacgag getgetctgg cctgacctca gggactcaaa atactttggc 


27660 


y <w acia w 


accgtgtccc acccctgctg gtaccccata cttacacaca gactggttca 


27720 


a pi t* ar 3 era r a. 


ctctcgcgca catactcgct cacacgggca catacaegtg cacacacagt 


27780 




cactcataca cacacaaata tccactcaca cgcatgcatg cacacacacg 


27840 


**** ^ ^ ^ 

gacacaccicci 


rtrsnma rarer tatacacaca tatqcqtqca cacgcacaca cacacacaca 


27900 




ctcccactcc cacactcagt tgctcagaca cacacacgcc tggctctcac 


27960 




tgggctctga aaggctccag cccttcccat getegtcaga agecagtcaa 


28020 


tggcttccta 


agtcaccaca cagatcaaag aggtgaactt ggccacacgg caccctyt-uu 


28080 


cctgagctcc 


caaacaccag ccttggtgag gacagaccct caccccacac cctcattccc 


28140 


actaccctgg 


gcaggcccag aggaggggca tetgeaggat ctggcaacca gcccctcccg 


28200 


cccggctcct 


gcagccggca ccatgggagt cagggggagg teactgeaaa gggcaacagc 


28260 


aagttggtgg 


ccccaggact agageccagg ggtcttcagt cctactccag agcttggaca 


28320 


ctgtcccaca 


gggcatggcc aagggaaggg cttccagagc cctgacttca gggaggaggg 


28380 


caggeggget 


cctgtggcag gcctggatgc atggccgccc actcctggga ctttctaacc 


28440 


tagaatatct 


aggtcaggct gggtgcagtg gctcacgcct gcaatcccaa cactttggga 


28500 


ggecgaggag 


ggtggatcac ttgaggttag gagtttgaga ccagcctggc caacatggcg 


28560 


aaaccctgtg 


tctactaaaa atacaaaacc tagecaggtg tggtagtgca cgcctgtaat 


28620 

£i U \J £a V 


cacagctact 


caggaggctg aggcaggaga atcacttgaa ctegggaggt ggaggttgca 


28680 


gtgagctgag 


ategtgecat tgegcaaaga agatctaggc cggcccctca aceggtgagg 


28740 


tccaggctgg 


gagtgetgag agactgtggt gacactgaat gaactaacag gcaaagggct 


28800 
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29160 
29220 
29280 



tccaactgag cctgggggtg gtgggaaatg gctcttgtgt tctagtcaag acctctgcca 28860 

accagttctg acactgaccc agcacagaac ctgacaggtc agcaagggcc agggcttagc 28920 

acagcccagg taagggtgtg tgtacggccc ccagagtcac tcccaggctg caagaaaagg 28980 

gacaaaggag ggacaagggg tggccaagca aactgttccc tctgctcggg agtctggggt 29040 

gacctggcct agctggccag tggagctggg ccacctcccc ttaaactctc caccccggac 29100 
ttcgactcca aagctttcct gccacccacg ctctccccac ctgggatcac ggccaggccc 
tgagccttca agggcccagg tgaactcagc cagactagga gctgaggagg acacagggca 
gcttccagaa cggacccgag aaccactccc agcaggttct gcttccagac aaggagctgc 

actttttcag ccaatgcaat tagaaagcca ggagaaggtg caaattccac ctgcctgagc 29340 

gtccgcactt cccaggccgc ccaccataca cacagcaaag atgtgtttaa ccattcaaac 29400 

ccatggccaa ccacatcggt tgcctcagac atgcaagttt taaaaaggaa cataactatg 29460 

ggccaggcac ggtggttcac gtctgtaatc ccagcacttt gggaggccga ggtgggtgga 29520 

tcacctgagg tcaggagttc gagaccagcc tagacaccat ggtgaaaccc catctgtacc 29580 

aaaactacaa aaattagctg ggcgtggtgg tgggcgcctg taatcccagc tacttgggaa 29640 
gctgaggcag gagaatcact tgaacccggg aggcgaaggt tgcagtgagc cgagattgtg 
ccactgcact ccagcctggg caacaaggga gactccatct caattaaaaa aaaaaaaaaa 
aaaaaggaac ataactatgg agtctcaagg ggaagtaatt ccttcaacaa taacaaatct 
tgaaagctga gctctttttt ttttttgaga caggatctcc tcactttgtc gcccaggctg 

gagtgcagtg gtgggatcac agctcactgc agcctcgatc tcccaggctc aaatgatcct 29940 

cctacctcag cctcccaaga agctgggatt acaggtgcat -accatcacac ccgattcatt 30000 

tttgtatact ttgaagagat ggggtctcac catgttgccc agtgtggtct tgaattcctg 30060 

gactcaggtg atctgcccgc cttggcctcc cagagtgctg ggattacagg cctgagccaa 30120 

cacccccacg ggttcatttt cagagtcgca ccgagtgctg gggttacagg cctgagccaa 30180 

cccccccacg ggttcatttt aagagtgaca ccgagtgctg gggttacagg cctgaaccaa 30240 

cccccccacg agttcatttt cagagtcgca ccgagtgctg gggttacagg cctgagccaa 30300 

cccccccacg ggttcatttt. aagagtgaca ccgagtgctg gggttacagg cctgagccaa 30360 

cacccccacg ggttcatttt cagagtcaca ccgagtgctg gggttacagg cctgagccaa 30420 
cccccccacg ggttcatttt cagagtcaca ccctttttct gaaaaacaac ttgggctcat 
gcaaattcga gagagagatg gtgacactcc ccgccccctg gacccaggtg gagtcgcagc 



29700 
29760 
29820 
29880 



30480 
30540 
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agaagectea 
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aaeggtctea 
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gggctcaatg 
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"5 *1 O C A 

31860 


agaggectte 


agaccatcac 


cccaccagac 


acctcagacg 


t c ggac c agg 


agggt c c age 


3 192 0 


aaccc ccaac 


acagac t cag 


ayy y ac uaay 


aagccacacg 


aggagugaac 


acaagaugug 
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ctcc ayy y ag 
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CtCCauCay t 


i^n r"T 4— /^f 4- 4— 4— /~r 

CCy tytLCty 
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O ^ U -a U 


ggttaggctg 


ggctggccac 


aaacaccccc 


aaaaaacatc 


tgaagccttg 


gcttgaaaca 


32100 


gctgacattc 


ctcatgaaaa 


ctgcagaccc 


ctgggtcctc 


ctgegcagat 


gggggagece 


32160 


agccaacccc 


acactcccac 


cttcaccaag 


aaagagaaag 


ccaaaacaaa 


ctcaactcag 


32220 


ccaatgacaa 


tcacagaact 


gaatcctgta 


gttagttcag 


ttggtttcat 


ttcagcaggg 


32280 
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gaaagatttg cagcctctat gagggtagct 


gggaacacaa 


agggccagag 


catggcccag 


32340 


gagaccccag cgcagtgggg tagatggttc cgagcacagg 


cctccctgcc 


aagacaagca 


32400 


ctggctcaaa tcctggcccc tcccattccc 


aggagacatg 


ctccacagga 


tgggaggaca 


32460 


cacagaggac ctgaggccag gaaaatgaca gcggcgcctc cgccgcccca 


cccgtgctgt 


32520 


catcatctta aatctacacft tctttgtggc 


aacgagggac 


actgtgaaag 


tcaaacaaca 


32580 


ggaaggcata ggccacaaat aaagacaaac 


gggacttcat 


gggaagctaa 


agattttgtg 


32640 


catcaaaaga cactatcgag agagtaaaaa 


ggcaacccac 


agaatgagag 


aaaatatttc 


32700 


caaatcatag atctactaag agattaatat 


ccatgaaata 


cagagaactc 


ctaaaactca 


32760 


acaatgagaa aacaactaag ccaactcaaa 


aatgggcaaa 


caacttgaac 


agacatttct 


32820 


ccaaagatga catataaatg gccaataaac 


acatcaaaac 


aggcttaata 


tatccctaat 


32880 


catcagggaa atgcaaatca aaactacaat 


aagataccat 


cttgcaccaa 


ttaggacggc 


32940 


tactatcaaa aaaacaaaat agcaagtgtt 


ggtgaggatc 


tggagcaact 


ggaacccttg 


33000 


tgcaccactg gcaaaaatgt gaaatggtgc agctactatg gaaaacagca tggcagttcc 


33060 


ccaaaaactt aaacacagaa ttaccatatg 


acccagcaat 


ttcgctttgg 


gttatatacc 


33120 


caaaagaact gaaaacaggg acacaatcag atatgcatac 


accttggatc 


acagcagcat 


33180 


ccttcccaac agctaaaaca tggaggcagc 


caggcatggt 


ggctcacgcc 


tgtaatccca 


33240 


gcactttggg aggctgaggc gggtggatca cctgaggtca ggagttcgag accagcctgg 


33300 


ccaacatggt gaaaccccgt ctctactaaa 


atacaaaaat 


tagctgggcg 


tagtgacggg 


33360 


cacctgtaat cccagctact cacaagtctg 


aggcaggaga 


atcacttgaa 


ccctggaagt 


33420 


ggacgttgca gtgagccaag attgcgccac tgcattccag 


cctgggtgac 


acagcgagac 


33480 


tctgtctcaa aaaacagcaa aacaaaaaca 


aaaaaacaaa 


caaacatgga 


agcaacccaa 


33540 


gcgtccctct actgagggat gaatagcggg gcaaaatctg 


ctccatccac 


acaatggagt 


33600 


actattcagt ctcaaaaagg aaaaagattc 


tggtcaggca 


cggtggctca 


tgcctgtaat 


33660 


cccagcactt ggggaggctg aggcgggtgg atcacctgaa gtcaggaatt 


caaggcccgc 


33720 


ctggccaaga ctggcaccna gctacacana 


aagtatangg 


ccccggaaa 




33769 



<210> 9 
<211> 72049 
<212> DNA 
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<213> Homo sapiens 
<220> 

<221> unsure 

<222> (8356) , (8385) , (38585) 

<223> Identity of nucleotide sequences at the above locations are unknown. 



<400> 9 

tataccttgc gcggaccttc "ggctcctgtg gtgaagacaa tatgaagaaa atagaaatta 60 

cccataattt tgccacacag acttagttgt gtccatgtat cttgtgcacc ttttttctgt 120 

ttacggatca aaatcgactt ttagggtcag gcgcggtggc tcacacctgt aatcccaaca 180 

ctttgggagg ctggagttgg ggttgggggg tggatcactg aagatcagga gtttgagacc 240 

agcctggcca acatggcgaa actccatctc tactaaaaat aaaagattag ccaggcgtgg 3 00 

tggtgggtgc ctctaatccc agctactccg gaggctgagg caggagaatc gcttgaaccc 360 

aggagacaga ggttgcagtg agccaggatc acgccactgc actccagcct ggcaacagag 42 0 

cgagactctg tctcaaaaaa aaaaataaaa ataaaataaa taaatacata aattgacttt 480 

taggagattg gttcaaacaa tgtgtgtaat gttgtgtctg agtgtttttc atttatcgtt 540 

catgcaaatt ccgacatcat tcactcttct ccagagtgtg ctgttttcct gcctgtgtca 600 

tcacccgtca ccttga.atgc cctcgtttag gtaaaataag tacattttat tcaaaaatat 660 

ttgaggacat ttgggttgtc tccaggttct tggtcttgag ttttgctgtt cttgtggagc 720 

catggtggtg tctggttgca ggaacctcca tgcgttccag ctgctgcttc tgcctgtgtt 780 

cttagagagg aaatgctggg gtccgcggtt cccgggctgc tgaccaggaa gcctgcggtg 840 

ctttacggcc cttccagaag cgggagatgc ccccacttaa gtgtcagaca ggcctttcca 900 

cctcactggc agctctgagc ggctcccttc tatttgcaga tgactgagaa gttaccaatt 960 

tccacgttta ctgactgctg tttctcctgt taatttgtat ttatagtctt cgctaattta 1020 

ttgctagggt tttggtgttg tccctattga cttgtatgcc ttttaatttt ttaaacaaca 1080 

ttaatatact tcattttttt agagcagttt taagtttaca ggaaaattaa gggacaagta 1140 

cagagagttc cttccacctg ctgtcctcct ctcctcctcc ccaccttccc tccttcccct 1200 

attgtaactt tctttctgat attataaaag tcactcatgg ctgggcgtgg tggctcacgc 1260 
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ctgtaatccc.agcacgttgg gaggcagagg caggcagatc acctgaggtc aggagttcca 1320 

gaccagcctg gccaacatgg tgaaaccccg tctctactaa aaacacaaaa agttagccag 1380 

gcgtggtggc gggcacctgt aatcccagct actcaggagg ctgaggcagg agaatggcgt 144 0 

gaacctggga ggcagaggtt acagtgagtc gagatcgcgc cactgcactc cagcctgggc 1500 

aataagagtg aagcttcgtc tcaaaaacaa agtcacacac gcttcttgta cgagggtcat 1560 

ttggccgagg ggccagatgg ctcaccatct agttgggaca ggccatgagc tcggaatgct 1620 

ttttacatat ttacatggtt gagaagaaaa tcaggagaat aatgttttgg gacatgggaa 1680 

aatgacatgg aatttgcatt ttagtgtcca taaatgaagt tttgtttgct cccagctgtg 1740 

ttgactgagg caggctggct tcctacagct gcggcagagc tgaggaggcg ggaaggagac 1800 

cgtgcaggcc gcagcaccga aaatatttgc tctctggccc ttcccagagt gcttgccgac 1860 

ctctgtccga cagctagaag gaaggatagg acccgtccga cgataaccac tgttgacatt 1920 

tgagcgcgtt tccttcccgg cttttgtgtg agagtggcag tctgtttgct tttgtggtcg 1980 

ggatctgctg cacgcacggc gggctgtttg catgaggctt cctggaggat agggctgggc 2040 

tcggagctgc acgcagtggg gcgtgtcctg catgcagtgg ggcctcagaa gagagctgtg 2100 

gtgggcgggg cagtgccaac gctggtgggt gccaggcctc cacgctcaga tcagccccgg 2160 

cgacaggttt gggccaccct ctctctggcc ■ tctgtgcagt ggcccaggcc gtctgctctg 2220 

cctggcacac ttgcctctgt ccttccactg aagcgctcct cttaccctct gctcccggct 2280 

gggtacgttg aattgtgtcc ctcaaggaga tatgctaaag gtctaacccc aggaacctgt 2340 

gtatgtgatc taatttggaa acagggtctt ggctgatgta atcaagcgag gatgaggtca 2400 

ccctagagta gggggcctat atccacggtg ctggtgtcct catgagagca ggtgagcaga 2460 

cactgacact caggggtgaa ggctgcatgg agtcagaaca gggcttagtg cgatggcggc 2520 

cacaagccaa ggaactccaa gtatttcctg caacaccaga agctggaaga tgccaggaag 2580 

gatcctgccc tggagccttc ggagggagtc tgtccctgca gacgtcttga cttttgattg 2640 

cagggatgca tgtcttaggg tgtgtggggg ggtgcatttc tgatgttaga agccacctgg 2700 

ttggtggcga tgtgtcacgg gagccctctg caggttctgc gtgtccatgt ggtcggggac 2760 

agaggtgggc agggacggac ggtgtcgagc tggacatgtc catgacgtcg gccatccctt 2820 

gggatggctt ttttgttttg aggataaggc tgcctgccag gaagctgtgc cctgcctggc 2 880 

ccttgcccca agcccctggc . ctgtgcttgg cctcgcggaa gggatgtcgc ccttctctcc 2940 

tgcatgcgtg cagggaggaa ggggagaggt cagcagcccg cctggaggag gctcgggcga 3000 
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3180 
3240 



ggggaaggtt tcactttcag gcaatgttgt ggggctgttt aaacaacccc aaagaaaacc 3060^ 

" atttggccaa actgttagtt tccaaacatt ttacttcctt ggtgtttaaa taaattccta 3120 
ccaagactct gtagctggtc ccagggaagg agttggcctc tcttctttat agcccggcac 
agtcagtccc ctgcacctgc ccctcccaac cccaggcctg cttccccgtg gccatggctg 

■ ctgcccggac ctctctacac acagaacctc ctggaggcca gctgtgggca ccagccttgg 3300 

cagggctgtg gcggagccca ggctgctggt actctctctg cagctgctcc ctgctggcct 3360 

ggctggacag cgtccccacc accactgggg tcacctctgt gctggtcaca gctcactcag 342a 

accttcaggc aaatgggttg gatectgcct ctctcccagg tgtctcagtc tctgcaaaac 3480 

tcaaaaacct cagaggcctt gcagcctgag gggtgtcaga gacacctcct tcgaatcagt 354 0 

aaacacctac agattcaccc cagcagtgaa aggactgctt cgccacagag gtttgattta 3600 

ctcctaagta attggaaggg atgccgagaa taggttcctc atggtgggac tagaggccct 3660 

ctgctgacct agttaacaga gggctagggc tgggtgtgct cagcccctga aggttctagg 3 72 0 

cccatttggg acaccccgcc agaacctgcc acaacctgcc atgtggtgac agctacctaa 3780 

atcccagagg ctcttgagct ggagagcaga cctctcaatc tcagcaggcc ccccacacag 3840 
accccataac cctagtctgc cttcacagta cagttcgtgg ctatgtgttc acggatggtg 
ttgttcacct aaggtctctg ccctgtgacc ccaagggcgt cctgagggca gattccaagt 

"ctgtttcgtc cacccctcct tccctagcag cgggtccagg gcctggcctg aactagcttc 4020 

ccacagagat actggtggga tgatgaaggc agccaggcgg caagtgaaaa acgcacttcc 4080 

tgcatgtgct ggctcctggg attgaagtgt ttgaggaagc aaagtgaagt gagctttcct 414 0 

cttgcggctg tgtgtccttg ggccgggagc ctaccctctc tgagcgttgg ggtccttgtc 4200 

agtagaatgg ggcatcctca tagctcaagg ggtggtgtgt gaaaattgtg ctattgtgtt 4260 

actttaatga tttttttttt ttcgagacaa agtctcaccc caacgcgcag gctggagtgc 4320 

agtggcgcga tctcagctca ttgcaacctc tgcctcctgg gttcaagtga ttctcctgcc 4380 

tcagcctccc aagtagctgg aattacagga gtgcgccacc aggcccggca tatttttcta 4440 
tttttagtag agagggggtt ttaccatgtt ggctaggctg gtcttgaact cctgacctca 4500. 
ggtgatccac ctgcctcggc ctcccaaagt gctgggatta caagcatgag ccaccgcgcc 
cggcctactt tagtgatttc ttaggaggac agagggaacg ggctggcaag acaggcttgg 
aatgtgtttt gggatcaagt gccggtttct gtctggcact ggcgttctct gtggggccat 
gatggacaca ctgctgaggt caagcgtgat tcgtcttgcg ctgtgcctgg cagtctcatt 
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ggaaagttct gtagacatcg tgtggatggg gctcttcccg gccaagccct tggggacctt, 4800 

ccaggactgt gatctcccca cagtggctgt taagcaggga cctttcgtga agtggagtct 4860 
ctggtcccct 'ccaagtcata gctagacagg gactcgggca tcgccaagcc tggctgatta 
ttcactggat gaggagacag gcccagagag gggcaggaac ctgcccgagg tcacccagca 

ggccccagag gtttcggtct cggattctcc ctgctcatcc ctggatgtag tgctgctgtg 5040 

gatgtggttc tgtgctgggg gctgtggaga gcagggggct tgtgccagga ccccagtgag 5100 

ggtggcgccc tcgccatgag gccgactgtt ggtatggggc ggccatccac tggggtgtgg 5160 

ggaggaacag cttt'cctgag gaggaggtgg cgggaggaac agcttccctg aggaggaggt 5220 

ggcggtgctg tgtgacctgg " gccttgaagg acaggtccat tgtcaacaga acattttggg 5280 

agtggagcct agagggagaa aatttgttga aattcagatt cccctccccc taccaataca 5340 

caccaaatca gatgcccctg accagatcta aatttggctc tcagagattt ccattgtagc 5400 

tgggcacttg gggaaccttc taagtgctgc ctctgcctct ccccagcctg cctgcctcag 5460 

tttccccagc cctgggcccg tgtcgctgtt gccatcacgt gggcgccctc tagtggagga 552 0 

atcagattat gcactccggg gcttggagca ggagtcagga ggggctcctg tctttccttg 5580 

aaacgttgga tgccgggatc ctggaacagt ctctgcattc ctcctggcga gaaccagagc 5640 

ctgggcacag gggaccatct gttgtttgaa ggctgcagcc tggcagggca ctcaggagat 5700 

ctggcagttg . gctgcagggc caggtctagg ggccagggca tcagggaggc tctgggctgg 5760 

ttcagccccg ggcccctttg cagattgtga cctgggcccc tgtgcagggg catggccaca 5820 

ggatgctggg aggggtcfct gaccctgacc ttcttggctc tgtgcatcct tgagaccaga 5880 

aaggtctgga acaaatgagt agacgatgcc ctaacctggg gagggagcca catcctgatc 5940 

ccagcaacct cgggaaggat ctgtcaggat tatggggcac cctgggggcc ccaagtctgc 6000 

atgggtctcc acttgcaatt tctgtaggaa gctctgataa atccaaactg ggggtcctag 6060 

gacacagtca gaaatgctga taccgttgtg tgtggagcct cgggccctgg gggtcaggag 6120 

catgtggagg gtgggccacg ggggttcaga agagaatcct gtaacccccc accccccaaa 6180 

ctgaagccca cttgagggcc atggctgaaa ggttgggggg tctccgtgcg tcctgtggag 6240 

tgggtggtga ggagtccttg ggtttgcacg cctctgggcc tgagcggcgg gaccccgtcc 6300 

acagcggatc cctgggccct gttgctcaga tgctctcaga gtgttgctgt ggccacggag 6360 

ggagcctgag ttaagcttct cttgtgccgg ttgtacgctg tcaggtcaca ctggtgagtt 6420 
aggcagggca cagatgccca gagcagaggg aactttcctt ggggattcaa cacgtgcaag 
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■tcttaggggc tggcaaatcc tgccctcagc tagagagggg gcttttattt gacjaccagaa 
tcacctgagc atcctcctgt ccccagctgt gtccagcctg tctgcaggga catcctgaga 
ggaccaggct ctcccctcat ccacctgcct aagtgccact ctgaaccctg tccacctgtg 6660 
ccgtggaggg gcgtgacctc aagctgctca gccagcagca ggcttggccc tggggggcag 6720 
cagagaccca ggtggctgtg gggtgggtgc ttcgtggcgt ggttctgaaa cttcgttgga 6780 
agtgtgtgga cagtgccttg cctgttctct gtgggaccct atttagaaac gaggtctgag 6840 
ttactggggg tcatcactgt gttctgatgg cccagctgtg tggaggccgc ggtgcagccc 6900 
catccaagga gccagggccc tgggtctagc cgtgaccaga atgcatgccc cggaggtgtt. 6960 
tctcatctcg cacctgtgtt gcctggtgtg tcaagtggtc gtgaaactct gtgttagctc 7020 
ttggtgttcc tgaaagtgcc cccgggtctc aggcctcaga accagggttt cccttcatct 7080 
cggtggcctg ggagcatctg ggcagttgag caaagagggc gattcacttg aaggatgtgt 7140 
ctggccctgc ctaggagccc cccggcacgg tgctggggcc tgaagctgcc ctcgggtggt 7200 
ggagaggagg gagcgatgaa gtggcgtcga gctgggcagg aagggtgagc ccctgcaagg 
tgggcatgct ggggacgctg agcagcatgg ccagcagctg ggtctgcagc ctggtacccg 
gcgggacttg tggttggggc tggtttgtgg ccaggagagg ggctggcagg agacaagggg 
gactgtgagg cagctcccac ccagcagctg aagcccaatg gcctggctgt gtggctctca 7440 
gctgcgtgca taacctctca gtgcttcagt tctctcattt gtaaaatgag gaaacaaaca 7500 
gtgccagcct cccagaggtg tcatgaggat gaacgagtga ccatgtagca tgggctgggt 75 60 
gcgtgtcacc taacatcacc agcctttgca aggagagccc tgggggcctg gctgagtatt 
tcccttgccc ggcccacccc aggcctagac ttgtgcctgc tgcaggccct tgacccctga 
ccccattgca cctgtctcca caggagccga ggaggtgctg ctgctggccc ggcggacgga 
cctacggagg atctcgctgg acacgccgga cttcaccgac atcgtgctgc aggtggacga 7 80 0 
catccggcac gccattgcca tcgactacga cccgctagag ggctatgtct actggacaga 7860 
tgacgaggtg cgggccatcc gcagggcgta cctggacggg tctggggcgc agacgctggt 
caacaccgag atcaacgacc ccgatggcat cgcggtcgac tgggtggccc gaaacctcta 
ctggaccgac acgggcacgg accgcatcga ggtgacgcgc ctcaacggca cctcccgcaa 
gatcctggtg tcggaggacc tggacgagcc ccgagccatc gcactgcacc ccgtgatggg 810 0 
gtaagacggg cgggggctgg ggcctggagc cagggccagg ccaagcacag gcgagaggga 8160 
gattgacctg gacctgtcat tctgggacac tgtcttgcat cagaacccgg aggagggctt 
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gttaaaacac cggcagctgg gccccacccc cagagcggtg attcaggagc tccagggcgg 8280 

ggctgaagac ttgggtttct aacaagcacc ccagtggtcc ggtgctgctg ctgggtccat 8340 

gcgtagaaag ccctgnaaac tggagggagc cctttgtccc cctgncttca gtttcctcat 8400 

ctgtagaatg gaacggtcca tctgggtgat ttccaggatg acagtagtga cagtaagggc 8460 

agcctctgtg acactgacca cagtacaggc caggcctctt tttttctttt tttttttgag 8520 

atggagtctc actctgtcgc ccaggc.tgga gtgcagtggt gtgatctcag ctcactacaa 8580 

cctctgcctc ctgggctcaa gtgattctcc tgcctcagcc tcctgagtag ctgggattac 8640 

aggtgcctgc cactgtgctt ggctaatgtt tgtatttttg gtagagatgg ggtttcaccg 8700 

tcttggccag gctggtcgca aactcctgac ctcaggtgat ccacctgcct cagcctccca 8760 

aagtgctggg attacaggca tgagccacca cgcccggtca ggccaggcct cttttgaaca 8820 
ctttgcacac catgggtctt ttcatccagg ggggtaggta cagttgtaca gttgaggaca 
ctgaagccca, gagaggctca gggacttgcc cagggtcaca cagcaggatg tggcaggtgt 

ggggctgggc ctggcagcgt ggctccagct ttccagcata gaaatctgtg aaagcagata 9000 

gtttgtcggt cggtagggga gactttctga gacccgcccc agcggctcag agggtagtag 9060 

ccaggggcct tcctgggggc tcataaccca gaacactgaa tgggaaaacc ctgatggagg 9120 
aggcgcagtg gagctgtggg tgccgatggg aagtcccaga ggagctggga ggtcagtagc 
ggtgctgccc tctgtggagc acttagtggg caccaggtgt gtttccaggt tcatggccct 

gggacctgaa gctcagaagg tgaagtaact tgcccagggc acccgtcggg cagcggcggg 9300 

cagaggattt gtgggctgtg gagcctgtgc tcgtggccca gccctggggg ttgtgagtgt 93 60 

gctggccggg gagcttttcc tgcaagtgga ctggtgtcta ggagccagca tgtcaggcag 9420 

caggcagcgg gagtgcagca ggcagcggga gcacagcagg cagagggcgg ggctcgagca 9480 

gccatccgtg gaccctgggg cacggaggca tgtgggagag ggctgctcca tggcagtggc 954 0 

tgaagggctg ggttgtgccc cgaggagggt ggatgagggt aagaagtggg gtccccaggg 9600 

gctttagcaa gaggaggccc aggaactggt tgccagctac agtgaaggga acacggccct 9660 
gaggtcagga gcttggtcaa gtcactgtct acatgggcct cggtgtcctc atctgtgaaa 
aaggaaggga tggggaagct gactccaagg cccctcctag ccctggtttc atgagtctga 

ggatcccagg gacatgggct tggcagtctg acctgtgagg tcgtggggtc cagggagggg 9840 
caccgagctg gaagcgggag gcagaggggc tggccggctg ggtcagacac agctgaagca 
gaggctgtga cttggggcct cagaaccttc acccctgagc tgccacccca ggatctgggt 
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tccctccttg 
gctatgtgca 
ccactagtca 
tttgggatgg 
gcacagggtg 
gcttgccgct 
acatttacaa 
actgtttacc 
cactgacatt 
acaccaacat 
accgacaccg 
taccaacact 
caccgacatt 
ctgacactga 
tctactgaca 
gacatttact 
caccagcatc 
ccaacgccga 
tttaccgaca 
gatatttacc 
caccgacatt 
cggcttgggg 
tggggtcatg 
ccctaaaatc 
cctcgggtgg 
cgccaagaca 
gcccagccac 
tctgctattt 
gccttgagag 



gggggcccca 
gaaggttctg 
gatgtgaggc 
gagatgagct 
gaagtgggca 
ggcatgctca 
acaccgacat 
aacactgaca 
taccaacact 
ttaccgacac 
tttaccaaca 
gacatctact 
taccaacact 
catctactga 
ctgacattta 
gacactgata 
taccaacacc 
cgtttaccga 
ctgacattta 
aacgccagca 
tactgacacc 
gcaggccttg 
gacttctgct 
gagtgtgcca 
cccaacggcc 
gacaagatcg 
cccctgcagc 
ggccacatgg 
ggtagcgcta 



gggaacaagt 
ctctgcccct 
tgccccagac 
cttggcccct 
tccctgtccc 
gtcctccagc 
tgaccgacac 
tctactgaca 
atttaccaac 
caacatttac 
ccgacgttta 
gacgctggca 
gacatttacc 
cactggcatc 
ccaacaccag 
tctactgaca 
gacatttacc 
cgccagcatc 
ctgacactga 
tctactgaca 
aatatttact 
ctcttggcac 
tcttctccag 
acttggatgg 
tggccctgga 
aggtgaggct 
cagatgtacg 
aatgcttgag 



cacctgtcct 



tacaacgtcc 



tcctccctct 
cctgggcagg 
gagagtccaa 
aggggagccc 
acttactgac 
cgacatttac 
ctggcatcta 
actgacatct 
caacactgaa 
ccgacaccga 
tctactgaca 
aacactgaca 
tactgacact 
catctaccaa 
ctggcatcta 
aacaccagca 
taccaacact 
catctactga 
ctgatgttta 
gacatcaaca 
tggggatgct 
cctcatgtac 
gcaggagcgg 
cctgcaggag 
cctgtggaca 
tattggcgag 
aaaatagtta 
tgtggttacg 



ttgcataggg 
aggtgctcag 
gtcatttctg 
gggctggtgt 
ccagggactc 
accagcatct 
cgacactgac 
ccaacactga 
actgacattg 
atttaccgac 
catttaccga 
ccgatgccag 
tttaccgaca 
gacgtttacc 
caccgacatt 
ctgacaccaa 
tttaccaaca 
gacatttacc 
tactggcatc 
ccaacaccga 
tttagccatg 
gcagagacca 
tggacagact 
cgtgtgctgg 
gggaagctct 
tgtttgatcc 
gcaccgatgg 
caatactttc 
taagatgtta 



gagcccttca 
ctcctccagc 
tccactgacc 
ggtgaaaccc 
tggtcactgg 
actgacacca 
atttaccaac 
catttaccga 
gcatctacca 
accgacattt 
cactgatatt 
catctaccaa 
ttgacattta 
gacactagca 
taccaacact 
catttaccaa 
ccgatgttta 
gacaccgaca 
taccgacact 
catttacgag 
tgatgggggc 
gacagactca 
ggggagagaa 
tcaatgcctc 
actggggaga 
aggaggccag 
gtgcctgtgc 
tgacaaaaac 
tcattcggcc 
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aggtgcctgt 
tttgaggcca 
tgtgagattt 
tgccctgctt 
gtttggtgat 
tccctttgct 
gtggacgcag 
agcctcgccc 
cctgccgctg 
tgtggtcggg 
ggtttcagtg 
tctcttgagc 
tgttgtcgtt 
tctgcctcct 
agtgcccgcc 
gttggtcagg 
agtgttggga 
ttctcttggt 
tgacattggt 
attttcctga 
cagggctccc 
cgagtgtgtg 
accttcaggg 
ccctaggagc 
ggcaaggccc 
gtgggagcag 
ccatgtactg 
gagtgcagtg 
tcctgcctca 



agacacagct 
gcccgggcaa 
aaattatttc 
aagacagaaa 
tggttcctaa 
gggtgcagcc 
cctcgccctg 
tgccgctgtg 
tggtcgggtt 
tttcagtggc 
gcctcgtccc 
ttttf tgttt 
gttgttgccc : 
tgggttcatg 
accacgcctg 
ctggtctgga 
ttacaggcgt 
gccagtgata 
gcctctcctc 
gctgcatcag 
aggggaggcc 
tctgtgttgc 
tggggaaccc 
tgcaggtgtg 
atgtttgcgg 
gtcagggact 
tctcagcagt 
gcgcaatcac 
ccctctgagt 



acttggagac 
aggggacaca 
acaatacaaa 
acatgggtgt 
gcagttgaag 
tcgccctgct 
ccgctgtggt 
gtcgggtttc 
tcagtggcct 
ctcgtcctgt 
atgggcgtgc 
gttgtttgtt 
: aggctggagt 
ccattctcct 
gctaaatttt 
actcctggtc 
gagccaccgc 
tgtacatgtg 

ggggtggatg 

tccttgttga 
cgaatcaggc 
aggatttcag 
agggatgggg 
gccatttcct 
gccgctgagg 
cgcgtgtctg 
ggttttcttg 
ggctcgctgt 
agctgggact 



tgaggtggga 
ggaatcctct 
attaagacaa 
tgttgaagcc 
gcagttttgt 
gctgtggtcg 

cgggtttcag 

agtggcctcg 
cgtcccgtgg 
ggacgcagcc 
tttggcagct 
tttgtttgat 
gcagtggcgc 
gcctcagcct 
gtatttttag 
tcacatgatc 
gcccagccct 
tccccatcgc 
cctccctctg 
cgtcaccatc 
agcttgcagg 
accctgcttc 
gaggctgagt 
gctggagctc 
atgatttgcc 
tggcacacca 
agacagggtc 
agcctcaatc 
acagacacat 



ggatcgctgg 
gcactgcttt 
aaagttaatc 
agaggcagct 
ttttccatag 
ggtttcagtg 
tggcctcgtc 
tcccgtggac 
acgcagcctc 
tcgccctgcc 
ttttgctcac 
tttgtttgat 
gatctcagct 
cccacatagc 
tagacagggg 
cacctgcctc 
ctgttgagca 
accatcgtca 
tttccagcaa 
ggggtcacct 
gcagggcagg 
tgagcgggag 
ggacgccctt 
cttgtaaatg 
ttcacgcatc 
ggcctgtgac 
tcgctcgctc 
tccctgggct 
accaccacac 



agtccaagag 
tgccacttac 
acatatccac 
gctggcctga 
atgtctgttc 
gcctcgtccc 
ccgtggacgc 
gcagcctcgc 
gccctgccgc 
gctgtggtcg 
ctgtggagcc 
tgtttgtttt 
cactgaaacc 
tgggattaca 
gtttcaccat 
ggcctcccaa 
tattttgagg 
cccattgagg 
cttctgaagg 
ttgctctcct 
atggagaaca 
gagtttcagc 
cccacgaaaa 
ttttgttttt 
cccgctaccc 
aggcgttgtt 
acccaggcga 
caggtgatcc 
ccagctagtt 
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13740 
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tttgtgtatt ttttgtgggg ggagatgggg tttcgctgtg gtgcccaagc tgatctcaaa 13500 
ctcctgaggc acaagcgatc cacctgcctc ggcctcccaa agtgctggga tgacaggcat 13560 
cagccgtcac acgcagctca atgattttat tgtggtaaaa taaacatagc acaaaattga 
tgattttaac cattttaaag tgaacagttc aggctgggcg tggtggctta tgcttgtaat 
cccagtactt tgagaggctg aggtgggcag. atcacctgag gtcaggagtt tgagaccagc 
ctggccaaca tgatgaaatc cagtctctac taaaaataca aaaattagcc gggcatggtg 
gcaggtgcct gtaatcccag ctactcggga ggctgaggca ggagaatcgc ttgagcccgg 
gaggtggagg ttgcagtgat ctgagatcat gccactgcac tccaatctgt gtgacagagc 
aagactctgt cttgaaaaat aaataaataa aaaaaatttt aaaaagtgaa caattcaggg 13980 
catttagtat gaggacaatg tggtgcaggt atctctgcta ctatctactt ctagaacact 14040 
ttcttctgcc ctgaaggaaa ccccatgccc accggcactc acgcccattc tcccctctct 
cccagcctct gtcaaccact aatctacttt ctgtctctgg gggttcactt. cttctggacg 
ttttgtgtga ctggaatcct gcaatatgtg gtccctgcgt gtggcttctt tccatagcat 1422 0 
tgtgttttcc agattcaccc acacattgtc gcacgttatc agaatctcat tcctgactgg 14280 
gtgcagtggg ttaggcctgt aatcctaaca ttctgggagg ccaaggcggg acgatcactt 
gaggcaggag tttgagacca gcctggccag cctagcaaga ccccagctac caaaaaattt 
taaaagttaa ctgaacgtgg tggtggtggg cacttgtggt tcccagctac ctgggaggct 
gaggttggag gatcgcttaa gcccaggagg tcaaggctgc agtgagctat gatcgcacca 
ctgcactcca gcctggacaa cagagcaaga ccctgtctga aaaaaaaaac aaaaaaaaaa 
gttcctttct ttttgtggct ggatgacatc ccattgtatg gccacagcac attttgtttg 
tctgtttatc gggtggtggg cagtggtttc caccttttgt ctcctgtgaa taatgctgct 
gtgaacattt gaattcaagt ttttgtttga acacctgttg tgaattattt ggatatatgt 
gtaggggtag gattgctgag tcctatggta atgttaggtt tgacttactg aggaaccatt 
aaactgtttt caacagtggc tgcgccgttc tgcatcccca ccggcagtgt gtgagggttc 
tgactttacc tcctcacaaa cgcttctttt ccatttaaaa aaatattcag ccaggtgctc 
tggctcacgc ctgtaatccc agcactttgg gaggccgtgg cgggcggatc acctgaggtc 15000 
aggagttcga gacgagcctg gccaacatgg tgtaacccca tctctaccaa aaatataaaa 
attagccggg tgtggcagcg ggcgcctgta atcccagcta cttgggaggc tgaggcagga 
gaatcacttg aacccgggag gcagaggttg cagtgagcca agatcgcgcc actacactcc 
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agcctgggtg 


acaagagtga 


aactccatct 


aaaataaaac 


aaaaataaaa 


ataaataaaa 


15240 


atttattaaa 


acattcatca 


cagccagcct 


agtgggtgtc 


ccatgtggct 


ttgcctcgca 


15300 


tttccctgat 


aactaggatg 


ctgagcgtct 


tgtcccaggc 


ttgccacacc 


tcagcacttt 


15360 


gagatacgtc 


gcacagtccc 


catttgcgaa 


cgagaaatga 


ggtttaggga 


acagcagctg 


15420 


tgtcatgtca 


cacagcgagc 


agggggtctc 


tgagccgtct 


gaccccacag 


ccgaccaagc 


15480 


tccaatcctt 


accgcctcct 


agtgttgtgg 


atgtagccca 


gggtgctccc 


acatttttca 


15540 


gatgagaaca 


ccgaagctca 


aaacaggagc 


gttttgtcca 


cattggatac 


acgatgtctg 


15600 


tggtttggtc 


ctgaagtcac 


tttatatctc 


agtggtccag 


actggagtag 


gacagggggt 


15660 


tctggggaat 


ggggaaggtg 


tctcaggtga 


aaggaaggaa 


ttccagattc 


tccatactgt 


15720 


ccttgggaag 


ttagaagact 


cagagggtct 


ggcaaagtca 


gacaaagcaa 


gagaaatgca 


15780 


gtcaggagga 


agcggagctg 


tccaggaaca 


ggggggtcgc 


aggagctcac 


ccccaggaac 


15840 


tacacttgct 


ggggccttcg 


tgtcacaatg 


acgtgagcac 


tgcgtgttga 


ttacccactt 


15900 


tttttttttt 


tttgaggtgg 


agtctcgctc 


tcttgcccag 


tctggagtgc 


agtggcacga 


15960 


tctcggctca 


ctgcaagctc 


tgcctcccgg 


gttcatgcca 


ttctcctgcc 


tcagcctccc 


16020 


gcgtagctgg 


gactacaggc 


gcctgccacc 


gcgcccggct 


aatttttgta 


tttttagtag 


16080 


agatgggatt 


tcactacatt 


agccaggatg 


gtctcgatct 


cctgacctca 


tgatccgccc 


16140 


gtctcggcct 


cccaaagtgc 


tgggattaca 


ggcgtgagcc 


accgcgcccg 


gcccgatttc 


16200 


ccactttaag 


aatctgtctg 


tacatcctca 


aagccctata 


cacagtgctg 


ggttgctata 


16260 


gggaatatga 


ggcttacagg 


ccatggtgct 


ggacacacag 


aagggacgga 


ggtcaggagg 


16320 


tagaagggcg 


gagagaggga 


acaggcggag 


gtcacatcct 


tggctttcaa 


aatgggccag 


16380 


ggagagacac 


cctctgagca 


tggtaggaca 


ggaaagcaag 


attggaacac 


attgagagca 


16440 


accgaggtgg 


ctgggcgtgg 


tggcttacgc 


ctgtaatccc 


aacactttgg 


aaagctgagg 


16500 


tgggtggatt 


gcttgaggcc 


aggagttcaa 


gaccagcctg 


gccaacatgg 


tgagaccccg 


16560 


tctctactaa 


atatacaaaa 


attagccagg 


cgtgatggtg 


catacctgta 


atcccagctg 


16620 


cttgggaggc 


tgaggcagga 


gaattgctta 


aacctgggag 


gcggaggttg 


cagtgagccg 


16680 


agatcccgcc 


actgcactcc 


agcctgggcc 


acagagtgag 


actccatctc 


aaaaaaaaaa 


16740 


aaaaaaaaga 


taaaaagacc 


aaccgaggaa 


ttgaagtggg 


ggggcgtcac 


agtagcagaa 


16800 


gggggatcgt 


ggagcaggcc 


accctgtggt 


catgcactgg 


aagctcatta 


cctgacgatt 


16860 


tggagctcat 


cactgggggc 


ctaaggagaa 


tagatactga 


aggatgagga 


gtgatggcgc 


16920 
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ggggcacggg 


tgtctttggt 


ggccagaact 


tggggactgc 


tggggtgcct 


cactgcaggc 


16980 


cttctcagcg 


ccctttatat 


gcttacacag 


gctgtttcta 


agagggggat 


acattgcata 


17040 


agcgttttca 


gactacctca 


tcatgggtcc 


ctttctttac 


cctctgtggc 


cctggtggcg 


17100 


cactctctgg 


gaaggtgcag 


gtggatgccc 


agacccgccc 


tgccatccac 


ctgcacgtcc 


17160 


agagctgact 


tagcctcgag 


attgctgctg 


gcacctcctg 


ccccgggaca 


cctcggatgt 


17220 


gcccgtggag 


atgctggctc 


tgtgttttct 


gctggagttt 


ggtgcgtctt 


ttcctcctgc 


17280 


aagtggccac 


cgctcttggg 


tatgtcctca 


ggcttctgcg 


agtcatggct 


gcttctcagg 


17340 


tccttgccca 


gcgccaggag 


caaaccctcc 


tggcactttg 


ttcaggggtg 


gatgcgccag 


17400 


tgttcctgct 


gtggaccgcc 


atctcacatg 


a gggtcttgg 


gcctgcaggc 


tcgttcagga 


17460 


aacacccgct 


gagtatgcag 


tgtgtgccag 


ctgtgtccca 


ggcaatggcg 


gggacagtgg 


17520 


ctgctgctgg 


ggttgtggtg 


gcttctgggg 


actctgggga 


cagctgaggt 


gcaaggagcc 


17580 


acggctcctt 


gaggatgcag 


ttggactcca 


ggtggaaggg 


atggttgggg 


gaggtataaa 


17640 


tggggtcagg 


gaggagacac 


atttggaaca 


atgggaacat 


ttttaagatg 


ctatgtcggg 


17700 


aggcaacaag 


gtggccaacc 


caggtgctga 


ggagcccaca 


ccagccctgg 


acgtgttttg 


17760 


ccgctcacct 


ttgctgggga 


gtggtgggag 


agaggattcc 


gttccacgtg 


gtggtgtgcg 


17820 


cagctgggct 


gtgtggagct 


gggcgctagg 


aggaaggtgc 


tttctgcggg 


gctagccggg 


17880 


ctctgccttt 


gaacacaatc 


aggctccagg 


ttttcagcat 


ccagtgcatg 


agaggacttc 


17940 


acgggcagct 


gtggctgatc 


ccttgatgaa 


ttgggagaag 


aacaaaggtc 


tatgaaatga 


18000 


ggtttcatgt 


agatggcatt 


agagacgccc 


acaacagatt 


tacagagtgg 


agcggagacg 


18060 


gcggatgggt 


ctgggaggcc 


cctcctgctg 


gccttgactg 


tgacagctgt 


cctgggaatc 


18120 


agcttccagg 


ccgccccagc 


agcctgactg 


acacacacag 


gggttttagc 


cccatcctgc 


18180 


gaccagctgt 


tgccatcatc 


agtgacagct 


gggagtggcg 


gtggttccag 


ccctgggcac 


18240 


cctccccacc 


tgctggggcc 


cacccagggc 


agtcctgaca 


cctacaggtt 


gcttggagcc 


18300 


gcatccgagt 


cctgccccac 


cacgtgtgaa 


gcccgagtgg 


tcgtgggctg 


aggtcccctg 


18360 


attgcatccc 


cacttccctt 


ctgcttcaca 


tagctgcctc 


ttctcaccgt 


ttttccagcc 


18420 


tcctgggcta 


ggaattccag 


tgttgtgctg 


gctttgcccc 


aggacacctc 


cttagccctc 


18480 


ttcctgagtc 


tagagccccg 


ggggttggaa 


gtcctggccc 


ctgggacacc 


tgcagccaca 


18540 


ctcagcttct 


cctgtgagcc 


tccagcatgt 


cccctcagga 


ccaagccctc 


acgttcttgc 


18600 


ctccccgccc 


acctgggctc 


agccagggga 


aggcctggct 


gggagcgtct 


cccctctgcc 


18660 
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ctgcccttct 
ctgtgccaca 
tgaggcccac 
acagcccagc 
ctggaagccg 
tccagggaag 
gtggcccttg 
tcatcatcac 
cagagaccct 
caggacgtac 
ccctgacttt 
gtctgttggc 
cactgcagag 
gtgtgtgttt 
aactgttgct 
acaacagaac 
gtttgcacag 
gactgtccag 
tgggctgctt 
ccttggtgtg 
agctggcagc 
attgtcctca 
actagagcag 
cacaccatgc 
aaaggaggtt 
attcaaattt 
ctggagtgca 
ctcctgtctc 
ttttttttta 



cccctcctac 
agacatggct 
gctccagtgc 
cctctcccga 
gggtcactcc 
gtcctctttc 
tatgcccctc 
catcatcatt 
ctgaggtcca 
caggtctggg 
tgctaagtgg 
tgcactgtgc 
ctgtgtgtgt 
gcgtgcgcgc 
gtttcctttt 
tctccccttt 
ccgagtcaga 
agtcagcctg 
tgggatctcc 
ctggtcaggg 
tcggagagga 
gacacagcct 
gagtcgagag 
tcaagctgag 
tcatgaccca 
cagtgtccat 
gtgctatggc 
agcctcctga 
attttagtaa 



cctgcccttc 
gtgtgtgaaa 
cactcttccc 
accccagccc 
agccgtatgc 
caaatggcga 
cagggtctgt 
gtcatttcgc 
gcctgagttg 
tcaggagtga 
gtggaggtga 
tcccagggat 
, ttgtgtgtgt 
acatgtgtat 
gggttttgtg 
aaacacgtgc 
ggaaacacag 
caggacggca 
cgtcattccc 
gggcgtgcgc 
cagggctgga 
ctcgggtcta 
ctacggcccc 
tgggttttac 
tgaaaattat 
aaataatttc 
atggctcgct 
gtagctggga 
agacagggtc 



tctcctctgc 
gtggcagggt 
acccgctggc 
catgtgccca 
catggtgggg 
cacctggtcc 
gcgctcggtt 
ttgtctgtga 
gggtctccgt 
cccccaaacc 
catcacttac 
ctggggagag 
gtgtgtgtgt 
aagatctttt 
ttcaacagag 
tgtcagaggg 
ggttcttcat 
gcagccctgc 
tcccagctgg 
ctgctctgct 
cccttgggtg 
gtttcatttc 
cgggccagat 
attttttaat 
atggaattca 
ttgagacagg 
gtacccttga 
ctacgggtgt 
tttctatgtt 



cccgccatgg 
ctggcatctc 
cgtgccctca 
gctgcccccg 
acatcctgct 
ctgcctggag 
ggcccgagtt 
gccggcctgg 
gctgacccct 
tcgtgccctt 
agcgggagtg 
gctatatccc 
gtgtgtgtgt 
tttattacat 
tggggtactt 
tgggtcttgg 
aaaaacactg 
ccctcagagc 
cagccggcgg 
caccctggga 
gcctctggct 
ctgaaaaaca 
ccagccctgc 
tacttgaaaa 
aaaaaaaaaa 
gtctcgctct 
cctcccaggc 
gtgccaccaa 
gcccaggctt 



cttttatatc 
tgtgggtctc 
tgctggaggg 
gccctctccc 
tccttggcct 
gctggaagct 
cccatcaccg 
tctcccagag 
gacggggact 
tgacaggcac 
atgggacagg 
tgggctttgg 
gtgtgtgtgt 
gaagcaagat 
cttccctcag 
gctcatgtct 
cacagcaggc 
acagctaggg 
ccggcccatt 
atgggacaga 
ggaccatctc 
agtgcacaga 
cacctgtttt 
aaaaaaagcc 
attatatgga 
gtcacccagg 
tcaagcgatc 
gcccggctaa 
ttctggaact 



18720 

18780 

18840 

18900 

18960 

1902 0 

19080 

19140 

19200 

19260 

19320 

19380 

19440 

19500 

19560 

19620 

19680 

19740 

19800 

19860 

19920 

19980 

2 0040 

20100 

20160 

20220 

20280 

20340 

20400 
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ccatcttggc 
ttgttttttc 
aggctgaggt 
aggtgaacta 
ttaaaaatat 
ttaggcatct 
gatcgtgtca 
aaaaccaaaa 
tgctttctgc 
aaaaagtttg 
cgttcctggg 
: cctcccagga 
ggctcggctc 
cggcccagag 
ccgatgggtt 
tccttcctcc 
ttggccgggt 
ggggagtgtc 
tgagccacca 
gccggcccag 
acagcccgtc 
gagagggagg 
tgggctcctt 
agttcaacat 
gagaaaaatc 
gtaaatccca 
accggcctga 
aataaataca 
gaatcgcttg 



ctcccaaagt 
actgataaag 
gggaggatcg 
tgatcatgtc 
atatttaaaa 
gaggtgggag 
ctacactcta 
aacatgtatt 
tccagagacc 
ctgacctctg 
tgtcctaaat 
gccaggacag 
aggcattctt 
ggaacctgcc 
gggccggtca 
agtattagag 
ccaccgggtg 
ctgggttctg 
taggggtctc 
agccagcgtc 
gcctcctctc 
aagtggctgc 
tttattgctc 
gtctggaagt 
tctccaagag 
gtactttggg 
ccaagatgga 
aaattagcca 
aacctgggag 



gctgggatta 
ttttgccggg 
cttaagccca 
attgcattcc 
agtattgggt 
gatggcttga 
gcctgggtga 
ggaacaeagc 
cttatggcct 
tcctggaaaa 
ttggcccctg 
gcacgttctc 
ggaaatatct 
ccaggtctgg 
agatgtgctg 
caactgagag 
ccctgggatt 
gaatccaggt 
tgtgggaggc 
ttgcgcttgc 
actttgtttt 
cggccgctgc 
ttcaatgaag 
ttggtattaa 
ataatagtga 
aggctgaggc 
gaaaccccgt 
ggcatggtgg 
gcaaaggttg 



caggctcgag ccacggagcc 
tgtggtagtg tgtgcctcta 
ggagtttgag gctgggctca 
agcctgggtg acagagcaag 
gtggtggctc acgcctgtgg 
gcccaggagt ttgaggttgc 
cagagcccag accctgcctc 
catgcctgtt cagtcacgtg 
gaaagctgaa aatattttct 
ttcatctccc aagttctctt 
ttatttctga actctgtttt 
tgcatcttgt cccctgacgc 
ggctccagga aaggcagagg 
gggaggcctg acccagcaga 
aaagttgtcc tcagaaggcc 
ctgctcattg caagcctgat 
ctgggatctg ggtggaaagt 
ggcaagtggt gaggttcagg 
tctgcccatc caggagattc 
cgaggctaca gccagcccca 
ggggccacct gggagtgtgg 
ccagcaccct tgtttgcctt 
ccagggaaat ggacttcctt 
aattaagaaa gtgtggaaat 
cctctgagct gggcgcggtg 
gggcagatca cctgaggtcg 
ctctactaaa aataaataaa 
cgcctgccta taatcccagc 
cagtgagcca agatcacgcc 



cagcctgttt 
gcgatttggg 
agtgatcagg 
aacctatctc 
tcccagctac 
agcgagccaa 
tttaaaaaaa 
ctctccatgc 
atcctttaca 
ccggcactgg 
ggctctgttc 
ccagaggctt 
cctcctgagt 
gtggcttttg 
actttgggat 
gttttcccag 
agggggcttg 
gagtggcttc 
cgcaggccct 
gccgggtgga 
agcaagggta 
gggccctctg 
gcctcacttc 
agagcaagaa 
gctcacgcct 
ggagtttgtg 
taaataaata 
taaggcagga 
attgcactct 



20460 

20520 

20580 

2064 0 

20700 

20760 

20820 

20880 

20940 

21000 

21060 

21120 

21180 

21240 . 

21300 

21360 

21420 

21480 

21540 

21600 

21660 

21720 

21780 

21840 

21900 

21960 

22020 

22080 

22140 
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22200 

22260 

22320 

22380 

22440 

22500 

22560 

22620 

22 680 

22740 

22800 

22860 

22920 



agtctgggca acaagagtga aactccgtct caaaaaaaat aaataaataa aaaataaaaa 
tagtgacctc tggccaggtg tggcagctca tacccgtaat cccagcactt tggaaggaag 
gccgagatgg gcagattgct ttagcacagg agtttgagac cagcctggcc aacatggtgg 
aaccccatct ctacaaaaat agaataaaat ttaagaggta atagtgacct tttggtagat 
cgaaacctgg attgctttct ttttctaaat gctgattctt ttctttgtgg tgtttgtgtt 
ctgtgccgat gtccctcccc cagccctgtt attgtgagtg gaagaagggg aaagggttcg 
cccgctactg tgagcccctc ctctcacgct gggtgtcctt ggagaagcct gcacttcttc 
attgtacgcc agggctgggt ccctccctgg agtggttctg tgctgctggg atggggccaa 
cccctcagat gttttctgag tgtcacacac aggtgtgtgc attcatggcc tttgcgtgtc 
ttcctgttgt ggaggcaaaa atgtgaagaa ccctagatga ttttgggacc agggctccat 
cacctgctgt tcattgcaca ccggagcatc caggcatggg tggagagctc agacttccag 
gcacggtcgc aggggctggt ctaaccatgt tcccgcccgc ctgctcgtca gaaccgcctg 
ttgggagctg ttatcatgat accatacctg ggccctgggc tatccgattc tgacttaatt 
gctccaggtt ggggccaggc cgttgtttgc tgttttgttg tttcttctgt gacgttagcc 22980 
actgggctaa tctgagcccc tcagttacag gtggagaaac tgagacccat gggggtgcaa 
ggacttgccg aggacccaga gccccttggg ggcagagctg aggcggggcc tggctttggg 
tcccagagct tccagtcccc ttcccgctct cctaacagct tttttttttg agacaagatc 23160 
tcaccctgtc acccaggctg gagtgcaatg gcatgatctc ggctcactgc aatcttcgct 23220 
agctgcgttc cagcgattct cctgcctcag cctcccgagc agctgggatt acaggtgtgt 
gccgccatgc ccagctcgtt tttttttgta cttttagtag agatagggtt tcaccatgtt 
ggccaggctg atctcgaact cctgacctca aatgatccgc ctgcctcggc ctcccaaagt 
gctaggatta caggctggga tcacactgtg cctggcccta gcagctttgt cctgtgccat 
ccaacaacag atgaccgaag tctttgtttc ttaacatgca ttccatctgc cttacagttt 23520 
tgccacctgc aaaacagagg acttgtcgct tttctggtaa gctggaaatg taatctggta 23580 
gcaggaggcc tgtggaagct tgcctttaat ggccttgtgt ctctttcatc ctgtcctgag 23640 
agccggagaa cttggatgtt gcacctaact caaccttcct gttaacatac agttctgcag 23700 
gctcatggat catcagaacc acgtcctatc tcacgcggct gtatgcttcc gttggttcag 23760 
gtgtttttac cttgacagta ttttctcctc ggtggctttt gcggtggttg cttttaatca 23820 
gcattgactc ttcaagaaaa atatttagct gctacatctc agaggagaca gggtggaaag 23880 



23040 
23100 



23280 
23340 
23400 
23460 
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23940 
24000 
24060 
24120 
24180 
24240 
24300 
24360 



catctgagac ctgcaggctc agacttagaa ccagaagtgc cctcagagtt catccggccc 
tgacccagcg ggaaatgagt tcacagagaa gcgggagaac tttgccccag gccctgccgt 
tgctcataac tgccccaggt ccttacattt gctccaggtc ctgccccagg ccctgcagtt 
gctcataact gccccaggtc cttatatttg ctccaggtcc tgccccaggt cctgcagttg 
ctctgtgtgg tgggtgtgat ctggagccct ccgcccattg ctgcacctgg ggcaggcatt 
gctaattgat cccaggactc cttcctgcgg agcaegccct ggttctccag gcagccgctg 
cctgtcagcc tgcagtggtt cgggagagga cacctgcttg cctggtctgt tccaaatctt 
gcttctcatc ccagcacagg tagggggtgc tatgggaaag ggatcctcag ttggccctgt 
cactgctcta tcagctgggg acgtggcatc ctagtgaaaa catcatggcc gggcgcggtg 24420 
gctcacgcct ggaatcccag cactttggga ggctgaggag ggtggatcac ttgaggtcag 
aagttcgaga ccagcctggt caacatggtg aaacccatct ctactaaaaa tacaaaaatt 
cgccaggtgf ggtggcgggt acctgtaatc cgagctactc gggaggctga ggcaggagaa 
tcgcttgaac ctgggaggtg gagcttgcag tgagccgaga tcttgccact gcactccagc 24660 
ctgggcaaca gagtgagacg ctgtctcaaa atctcaaaca aacaaacaaa caaaaaacaa 
acaaacaaag cgtcatttat ccagcacccc tggggaacca tgctacctgg tgttttatgg 
tacctggcaa ggtgcaggtg aagttgctgc tcttgggcat tgaacccgtc ttgtttgggg 24840 
cagctcaggc cccaggcagg gtccgggttg gctctcgttg gtgtggccct ggcccatcca 
gacctatatt tctgccgtcc tgcaggtgat caatgttgat gggacgaaga ggcggaccct 
cctggaggac aagctcccgc acattttcgg gttcacgctg ctgggggact tcatctactg 
gactgactgg cagcgccgca gcatcgagcg ggtgcacaag gtcaaggcca gccgggacgt 
catcattgac cagctgcccg acctgatggg gctcaaagct gtgaatgtgg ccaaggtcgt 
cggtgagtcc ggggggtccc aagccatggc tcagccatgc agacttgcat gaggaggaag 
tgacgggtcc atgcctgggc ataagtgttg agctcaggtg ccccgacctg gggaagggca 
ggacaggaaa ggtgacagta tctggccaag gacagatggg aagggaccaa gggagctgat 
tagggagtgg ttatggacta ggaatgtcgg taacaatggt tagaaagtga ctaacatttg 
ttgagcacct gctgtgtgcc cggccctggc cgggagcctt cgtgcccaca gtgaccccgt 
ctgcaaatgt agttccttgc cctactcgca ctggggagca ggacgcagag ccgtgcaact 25500 
cacaggtgcc aagctcagga ctccctcctg ggtctgcctg ggctgggctg tgcttgttgc 25560 
ccctgtggcc cacgcatgtg caccttccac ctgaaagcca ggatcttcag gacgctcccc 



24480 
24540 
24600 



24720 
24780 



24900 
24960 
25020 
25080 
25140 
25200 
25260 
25320 
25380 
25440 



25620 



108 



25680 
25740 
25800 
25860 



25 980 
26040 



gaggaggtcg ttgtctggca caatgatttg tctcttcctg aaaaggtgac agagttacac 
tggagagagc agcatccagg tgcggcaggg acaggcctgg ggctcgcggg cagggactct 
gtgtcctgcc ggggtcccac actgcacctg cttgtcagag gcactcagtc aatctttgct 
gatgaaggat gagaggacag aggacgtgat gcttgctgct gcattgcctg cagtcctggg 
tgagatgccc gggttgactc tgctgcccgt cgggtggatg tgatgtcaga tccccggctt 25920 
taaaatacga gggagctggg aattgaggga gcaggttggg gcagaaagca cagccccgtg 
gaagcctgga gctgaggcag tgtgggcgac ccctggagca gtgagtgctt ccttcatggc 
cttcatcgca ccctgcagtc ctcatgtagg ggatgccatc catgaattta gttttcccag 26100 
cctcctttaa aaacgcgttc atgctggggc cggggcagtg cagtggctca catctgaaat 2 6160 
cccaccactt tgggaggccg aggcgggtgg atcatgaggt caggagatcg agaccatcct 26220 
ggctaacaag gtgaaacccc gtctctacta aaaatacaaa aaattagccg ggtgcggtgg 
cgggcgcctg tagtcccagc tactcgggag gctgaggcag gagaatggcg tgaacccggg 
aagcggagct tgcagtgagc cgagattgcg ccactgcagt ccgcagtccg gcctgggcga 
cagagcgaga ctccgtctca aaaaaaaaaa aaaaagtaca aaaaaaaaaa aattagtctg 26460 
ggtgtggtat cacgcgccta taatctcact actcgagagg ctgaggcgga gaattgcttg 
aacccaggag gtagaggttg tagtgagccc gtatcgtacc actgccctcc acctgggcaa 
tagagcgaga ctctgtctca aaaagaaaaa aaaaaaaaga acatttatgc caggtgtggt 26640 
ggctcatgcc tgaaatccca gaactttgga agactgaggc aggaggatca cttgagccca 26700 
gaaatttgag agtgtcttcc ctgggcaaca tagagagacc tcatctctac cagaaaaaaa 
aaaattagcc cggcatggtg gcatatccct gtggtcccag ctacttaggg ggctgacgtg 
gcaggatcac ctgagtctgg aggcagaggt tgaagtgagc tgagatcatg ccactgcact 26880 
ccagcctggg tgacagacag agaccctgtc tcaaaaaaaa aaaaaaaaaa aagcatttac 26940 
tatccaccat ggaaggtgag actgacctgt gagtgattgt tcaaagaaca aaaaataaac 27000 
cccagagata agacaaaagg gtgcctccat gggggtgtga tttaaagctg agaaattggg 27060 
cttcttcccc ctcccctctc accccgtggt ttgctaaagg agatgggaaa aaggattctt 
tttttggctg aaatattt.aa cactaaatta aagccaattt taacagcact ttggttgatg 
agtgaaatta acagactggc caaaaataaa cgaacggtct gtactatgtg aaaaagaggc 
: agctttggcc atgctgggcc aatgtgagtt ttcagggttg ctgggaatgt ctgtgaatcg 27300 
gaggaagggc ctagctggga ctctcaggag ccaaggccct gaggggcaac ttgcctggtc 
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cctgecctga ggcgttcact gctttcttcc tgggccagat cacaggcccg gaggctggac 27420 
cactgggctg gcactcttgc cgagctgctc cctgacttcc tgaccatgct cctttcagca 27480 
gccttgctgc actttagttt ccttgaatga aaaatgggga tgagaatagc tcctacctcc 
aaggtgaatg gagtgagttc ggacaggtga ctccctggga ccagtgcctg gcgcctgaca 
aggtccagtc agagcccgca ctgctgttac tgataccctt ggctgtacca ggggagaact 
tggttgccat tgccaggtgt tctcccacca cccccactac tgtccctgtt tgatgtgtgg 
cgggaataaa gctgtgcaca ttggagcttt tggcacatcc tggctttcag gtgaaaggtg 27780 
cgtgtgtgtt tgagggttta gcctggccaa cccagccatg aggtcggacc tgacctgggg 27840 
gtgagtcctg agctcggcac ccctgagctg tgtggctcac ggcagcattc attgtgtggc 27900 
ttgggccgca cccctttccc tgctgggctg ttgatgttta gactggagcc tctgtgttcg 27960 
cttccaggaa ccaacccgtg tgcggacagg aacggggggt gcagccacct gtgcttctgc 28020 
acaccccacg caacccggtg tggctgcccc atcggcctgg agctgctgag tgacatgaag 28080 
acctgcatcg tgcctgaggc cttcttggtc ttcaccagca gagccgccat ccacaggatc 2814 0 
tccctcgaga ccaafcaacaa cgacgtggcc atcccgctca cgggcgtcaa ggaggcctca 282 00 
gccctggact ttgatgtgtc caacaaccac atctactgga cagacgtcag cctgaaggta 28260 
gcgtgggcca gaacgtgcac acaggcagcc tttatgggaa aaccttgcct ctgttcctgc 28320 
ctcaaaggct tcagacactt ttcttaaagc actatcgtat ttattgtaac gcagttcaag 28380 
ctaatcaaat atgagcaagc ctatttaaaa aaaaaaaaga tgattataat gagcaagtcc 28440 
ggtagacaca cataagggct tttgtgaaat gcttgtgtga atgtgaaata tttgttgtcc 28500 
gttgagcttg acttcagaca ccccacccac tcccttgtcg gtgcccgttt gctcagcaga 28560 
ctctttcttc atttatagtg caaatgtaaa catccaggac aaatacagga agactttttt 
tttttttttt tgagacagag tcttactctg ttgcccaggc tggagtaccg tagcgtgagc 
tcagctcact gcaacctccg cctcccaggt tcaagcgatt cttctgcctc agcctcctga 
gtagctggga ctacagacat gcaccaccac acccagctaa ttttttttat atttttagta 
gagacagggt ttcatcatgt tggccaggct ggtcttgaac tcctgacctc aggggaacag 28860 
acggggttgg cctcccaaag ggcggaaata acaggggtga gccaccgttc ccggcctagg 
aaaacttttt gccttctaaa gaagagttta gcaaactagt ctgtgggctg gccttctgat 
tctgtaaaga aagtttgatt ggtggctggg tgcggtggct cacacctgta atcccagcac 
tttgggaggc cgaggtgggc agatcacctg aggtcgggag ttcgagacca gcctcaccaa 
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cgtggagaaa ccccgtctct actaaaaata caaaaaaaaa attaaccggg catggcggcg 29160 

cctgcctgta atcgcagcta ctcaggaggc tgaagcagga ; gaattgcttg aacctgggag. 29220 

gcggaggttg tggtgagctg agatggcacc attgcactcc agcctgggca acaaaagtga 2 9280 

aactccgtct cagaaaaaaa aaagtttgat tggtgtaacc aaagcgcatt tgtttatgga 2 9340 
ttgtctgtgg cagcttttgt tctgccgaga tgagttgtga cagatctgta tgggctctaa 
agcctaaaac atgtgccatc cgccccttta cagaaaaagt gtgctgacct ctgttctaaa 
gtattggaca actacaatgt ttgctcattt attattctat gatttgtttt ctgctttttg 

ttgttgttgt tgttgttgag atagggtttc cctctgtcac tcaggctgga gtgcagtggt 2 9580 

gtaatttcag ctcactgcag cctcgacctc ctgggctcta gtgatcctct catctcagcc 29640 

tccctagtag ctgggactac aggcacacac caccactcct ggctgatttt tttttttttt 29700 

tttttttttt gtggagacag ggtttccgca tgttgcccag gctggtttca aactcctagg 29760 

ctcaaacacc cacctcagcc tcccaaagtg ctgggattac aggcgtgagc caccatgccc 29820 

agcctattct actgtttgta ttacatagct ttaaaagatt ttttatgact ttaagtcaca 29880 

agggttcttt gtagaaaaaa atatatatat aggaaagtat aaaaagaaag taaaaattgt 29940 
ccataacctc tccagccaga gacgaccgtt gctgacacct cagcatattg cctttaagtc 
ttttttctct aagatagcat ttctcttcat cacagtcata tgctacgcag aattctgtat 
cctgattttt tcacttgaca ttacaacagg tatttgatgg cgctgtgaca aactctttgg 

cacaatcttt taaatgtatg aaatactcca ctgcacagat gtttgctttt aggcttaact 30180 

gttcttttat tttgcgtgtg ctggttacag ccgggcacag tggctcatgc ctgtaatcac 30240 
aacactttga gagggtgagg caggaggatc acttgagccc agaagtttga gaccggcctg 
ggcaacatag tgagacccca tctctacaaa^ aaactttttt aataagtcgg gcgtagtggt 

gcatagctgt agtcccagcc accaaggagg ctgagttggg aggattgctt gagccccagg 30420 

aggttgatgc tgcagtgacc tgagattact ccactgtact ccaacctgag cgacagagca 3 0480 

agacttgtct ggggaaaaaa aaaaaaaaaa tatatatata tatatatata tatatacata 30540 

tatacataca cgcacacaca cataatataa aaatatatat ttataaatat ataatatata 3 0600 

atataaaaat atatatttat aaataaaatt tataaattat atttataagt aaatatataa 30660 
tatataatat aaaaatatat attatataat atataataaa atatataata taaaaatata 
tatttataaa taatatataa tacatactta taagtatata tttaaaatat atgtaatgta 
tattttttaa tgtatgatat ataatataca tttataaata cacatttata ttattttata 
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taaaatatat ataaaatctc caagttgctt tttccaaaaa ggtgtcttgc tgcatttcaa 30900 
acattcattt aaaaacttga atgctggtga tctggtccag aatgtgttca gtagctgctg 30960 
ccagtggcca agcatctcgg ' gagatgtcta caaaacacgc tggttctggc ctggcgtggt 31020 
ggctcacgcc tgtaatctca gcactttggg aggctgaggc aggtggatca actgaggtct 31080 
ggatttcgag accagccttg ccagcttggt gaaaccccat ctctactaat aatacaaaaa 31140 
aattagccag gcgtggtggc atgtgcctgt aatcccacct acttgggagg ctaaggctgg 
agaatcgctt gaacccaggg ggcagaggtt gcagtgagcc gagatcgcac cattgcactc 
caggctgggc aagaagagcg aaactccgtc tcaaaaaaaa aaaaaaagat gctggttcct 
aaaatgtggc ccttttcctc ctcacctgct gccagaccat cagccgcgcc ttcatgaacg 31380 
ggagctcggt ggagcacgtg gtggagtttg gccttgacta ccccgagggc atggccgttg 31440 
actggatggg caagaacctc tactgggccg acactgggac caacagaatc gaagtggcgc 31500 
ggctggacgg gcagttccgg caagtcctcg tgtggaggga cttggacaac ccgaggtcgc 31560 
tggccctgga tcccaccaag gggtaagtgt ttgcctgtcc cgtgcgtcct tgtgttcacc 
tcgtatgaga cagtgcgggg gtgccaactg ggcaaggtgg caggctgtcc gtgtggccct 
cagtgattag agctgtactg atgtcattag ccttgatggt ggccaggact ggtagggccc 
tcagaggtca tggagttcct tcgtggagcg ggtgctgagg ctgtatcagg cacagtgctg 31800 
gctgctttca cctgggccgt ctcaccgaag tgtccatgga gcctgcgtag ggtgggtatc 31860 
tgtgtcgatt ttacagatgc agaaacaggc tcagagaaac cgagtgactt ccctaaggtc 31920 
acatacccag ttagagcaga gctgggccag gaagtgctgt ctcaggctcc tgaccaggtc 31980 
tccttgcttt gcactcttgc caaaaccatg atccagaact gactttgagg tccccggacc 32040 
tcaggctcct ccgaaatggc ctcttggagg ctgctgagcc acagcttagg acccacctcg 32100 
agaggcaaat gtgctttgag ctgccaggcg tcctgggggc cctgccttgg gcacggggtt 
cagacaggcc ccagatgtgt ggggcgtctt tctggacttg agttttcttt tctgtgtggt 
ggacacagtg ctcacccctt aaagcacctg tgatgtgtgc agcagcccaa tccctgcctg 32280 
tcgcctgttc tgctagggaa ggaaggaata cttcaggatg gcaggacaac agaaagaggt 
ccaggtttta gagcaagggc aggtcaaact tagaaaattc tggaatgagg atgtgcattt 
cctcttctgg atctgctaaa agaagaggga aggaggggct gctgggggag gagcccagag 
ccgagtttac atccggatcc cgcaaggcct cccctgccct gaggtcttgt tttgtgatgt 32520 
gcttgtgtcc atcctggttt ctgccgtgtc cccaacatcc ggccaagctt aggtggatgt 
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tccagcacac actcaccctg tctgtgcacc tgtttttgtg tccgtaagtg ggtatttact 
caccttacga gtgagccact gtgggaattc agggaggtgg cgcagtgacc acccctggag 
ggatatgtgt gtggcagggg tcgagggtct cgcccttccc tgcttcctgc gcgtggcttt 
ctccaggacg gggagggctg agctgaagag gtggggacag ttgcgtcccc ccgccaccca 
ctgtcctgcg gtgagagcag actcactgag cctgcccttc tcccttgtgc cttccagcta 
catctactgg accgagtggg gcggcaagcc gaggatcgtg cgggccttca tggacgggac 
caactgcatg acgctggtgg acaaggtggg ccgggccaac gacctcacca ttgactacgc 
tgaccagcgc ctctactgga ccgacctgga caccaacatg atcgagtcgt ccaacatgct 
gggtgagggc cgggctgggg ccttctggtc atggagggcg gggcagccgg gcgttggcca 
cctcccagcc tcgccgcacg taccctgtgg cctgcaagtt ccccaacctg gcaggagctg 
tggccacacc cacgactgcc cagcagcctc accctctgct gtgggagttg tccccgtcca 33240 
cccctgggtg cctttgctgc agttatgtcg ggagaggctc tggtgacagc tgtttcctgt 33300 
gcacctgctg ggcactaggt cccagctaat ccctgtgcca ggactctaat ttcaccctaa 33360 
cacacatggt ggttttcatt gctggggaag ctgaggcctg agcacatgac ttgccttagg 33420 
tcacatagct ggtgagttca ggatccccca gagataccag ggccagcact cgatccccac 33480 
ccagccctga accccaccat gtgctgggat tgtgctggga gtgtccacac gcctgggacc 33540 
ccagggctgg tgctctcatc tcctttttcc agatcatgag aatgaggctc agggaagttt 33600 
gaaaaaaacc tatcccaagt cacacagcaa caggagcagg atttgaaccc agaaaagggg 33660 
accgcacact ctgttctgct agagtagtta gctgtcctgg gtgatatggc aggtgacagg 
ggcaactgtg cttaacaaag gaacccccat cccccctgcc aagttgggag actagaaggt 
caggggcaga agctctgaag ggccaggtgc agtggctgac acctctaatc ccagcacttt 
gtgaggccaa ggcgggcaga tgatttgagc ccaggagttc aagatcagcc tgggtaatgt 
agtgagacgc catctctaca aaaaaatttt ttaaaaatta gctgggcatg gtggttcatg 
cctgtagtcc aagctacttg ggaggctcag gtgggaggat tgcttgagcc caggaggttg 34020 
aggttgtggt gagctgtgat catgccactg cactccagcc tgggcaatag agtgagaccg 
tctccaaaaa aaaaaaaaga agaagaaaaa gaagctctga ggctccaagt ccccaggcac 
cccttggctt gagggcagac aagggaggag agggtcacct gggcagccct gacttttgtc 
ccctggcaaa gggaccttca gtgaccttgg ccctaggaga gcctctgagc acgtcagcca 
tgtcgaaccg ctcaggaagg gcagcaagaa tttggcttct gacctctgcc tctcctactc 
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gccatctgca ctgggtgtgg ttgtgcccat tttacagatg aggaggctgg ggcatcgacc 
agctgaatgc cttgtcccag gtactgcgta ggcagagctg gcagttgaac cccgtgtcct 
ggttgtcgct gggggtgggc tgcaccctga cttgtgaggc cagtagcaag gtttgcacgt 
gacttcgtga ccgtcaccca gctctgcagc acatcccgtg acccagctca tccaggccgc 
atgcaaacct gttgccaggc gagaaaccag tcaccgcaca gctgtggttg cctgaaatga 
ttaagctcat taatcacccc ggagtgagga cagactcaga tgaaaaccag caaaagccct 
ggaaactcat gtgaccctgc caatgagggc ggccatgtgc attgcagcct ggccgtcact 
cctcggtacg tgttttggac ttaaacgctc cggatgttta ctgagtgctt gattaataac 
atggaaggcc tggtctcatt gctgtgggag tgaaggatgc acagccaggc ctgacatgat 
gagaacaaga acctggagtc tcgctgcctg ggtggtaatc ctggccctgc cacttagcaa 34920 
ctgtgtgact gtagccaggt cacttaattt tgctagatcc tgcctgcgct tcagtggatc 
ttgctggttt tccaaggtgg ccaaacactt taaggcattc atgtggtcgc taggctgcag 
ggttgaaccc tggctcaccc cgcagggcgc cgtgtgctct gtggcctggc tgtgcctttg 
ctgacaccgt gcccgtgtgt gttcatgcag gtcaggagcg ggtcgtgatt gccgacgatc 
tcccgcaccc gttcggtctg acgcagtaca gcgattatat ctactggaca gactggaatc 
tgcacagcat tgagcgggcc gacaagacta gcggccggaa ccgcaccctc atccagggcc 
acctggactt cgtgatggac atcctggtgt tccactcctc ccgccaggat ggcctcaatg 35340 
actgtatgca caacaacggg cagtgtgggc agctgtgcct tgccatcccc ggcggccacc 
gctgcggctg cgcctcacac tacaccctgg accccagcag ccgcaactgc agccgtaagt 
gcctcatggt cccccgcacc tcactccctc gttagatcag gctggttctg ggagctgacg 
ctgaaaggag cttctcatct ggggttcctg ggtgtacata gatggttggg taggttgtgc 
actgcacaag ctgcatgatg ctacctgggg gtccaggtcc aggctggatg gacttgttgc 
ttcatcagga catagataaa tggccaaaac tcctcagctg gaaggtcctg ggcaggatct 
ttgggtgtga aaaccagtca caggggaagg gtgcttgctc atactgccag cacagtgctg 
agtgctttcc atagcgctcg tttactcctc aagcctggag ggtggggagt agcatggtcc 
catttcacgt acaaggaacc cgatgcacag agaggtgtgg caacccatcc aaggccatac 
aactggggtg ggttgagccg gggttgactg tggcaggctg gctcaagagt ccctgctcct 
gaacccttgc caggcagcct ggcatcagct cggggaattt ttgccctgac ccttggaagc 
aagtgggcct ctttgttctc atgtcagtga tgagaagagt gactttccta tggcccctct 
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ggagtacagg tgtttcctgt tggcgggctc ttcccccatg acatcagcag cgagctggtt 
atgattccct acgcagaact tgatagttta taaagctctt tgtcatccag gccccgttgg 
agtctcacgc agacctggtc gcaggcgggg ctggtcttgc ctgtcccagc tgcatggatg 
gggaacttga ggcttgcaaa ggttaagggg ctgttcgagg cccacgctgg caggagatgg 
gcctgggcca gagtctggga cttcccatgc ctgggctgtc tttggtcctg ttgctcacca 
tccctccctg gggccatgac cttagagagc caaatggagg tgcaggtaac ccacggcaag 
gaggggttgc catgactcag agtccccgtc ctgtggccgg cagtacctgg tgcaacgact 
tggatttcag accagccact gtagcccgct gacggtgcgc tcgaagtgcc acagcttctg 
aagccaggca ggactcaggc caggagactc tgttagctgt tgagagggag aggccaacgg 
atgttctggt tctgctagag agctggttct tcggatcctg gtaccagtgc actgagagga 
ggcccagctt gattctgggg ctgccttgtg gtggcatgtg ctgctcactg acaccctcga 
ggagtgt.ctt ctctcgggct tgttgactgt gcccggtttt ccgcagttca ctggtgcaca 
cataggcaca tagcaaaccg cacacacagt cgtgggtatg agtttcacta cattccacca 
ccagtgttca ctaccattac ctgccttccg tcttaagtgt tcatcattta aaaataaatt 
tattgggctg gacgcggtgg ctcatgactg ttatcccagc actttgggag gctgaggcgg 
gcagatcacc tgaggtcagg agttcaagac cagcctggcc aatatggtga aactccatct 
ctactaaaaa tacaaaatta gctgggcatg gtggggcatg cctataatcc cagctactca 
ggaggctgag gcaggagaat ggcgtgaacc cgagaggcag agcttacagt gagcccagat 
agcaccactg cagtccagcg tgggcaacag tgcgagactc catctcaaaa aaaaaataaa 
taaataaaag aaaaataaat ttatgatcta tttcaaaaat aacacatgta ctttgaaaca 
gcagagacac atatgacacg gagaatgaaa ttccccatag cgcaccccca agagacagcc 
ctggtccccc cgtctttccc gtggacctcc agcggggcag atgctgagcc gcctgttgtc 
gagtggcatg ctatcccgtc ctccagctcc tctgtggctt acagacaccc acctgcagcc 37440 
. ctgtctttgc ctcctctagc gcccaccacc ttcttgctgt tcagccagaa atctgccatc 
agtcggatga tcccggacga ccagcacagc ccggatctca tcctgcccct gcatggactg 
aggaacgtca aagccatcga ctatgaccca ctggacaagt tcatctactg ggtggatggg 
cgccagaaca tcaagcgagc caaggacgac gggacccagg caggtgccct gtgggaaggg 
tgcggggtgt gcttcccaag gcgctcctct tgctggtttc caggctgctg cccctgtcct 
tagcagaggg aggaaacaga ggatggctct gggtgaatga tgacttgggc ttcgattatg 
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tagtcacagg gtatgaccct gagatgcgtg. gaaccccgag actgtgatta tatgtagaaa 
ctgggtttcc ccgttgttta agtagtcatg gtggggtcag accccacagg acttttgtct 
tttcaagaaa gaaaatggtc gtgtgtcatg caggggtagt tggtactggt taatccaggt 
ttatccttta ttttgtggga actgtacagt catttctgct acaatgctgt atatgctctt 
ctgaaagaca cctatgcaaa atcgcacagt aaaaatgaca caactcatag ggaaagcggg 
gccagggcac agccctcaaa atctccatca atgacatgta agaaaagaga ggaacctggg 
aaatagcaaa gtgccttttg cacattaaat ggttagctat atcccacaat actgtgcatt 
cgtaaacgtt aatgctgcaa taaatacggc acttcacctt gggaagatct ggagttggct 
tatgagtgtg gaagggtgta gcgcatgagt ttttgtgaaa cactggaagg aggattgtgg 
gaaatcaaat ggaaagttct caccccaggc gtggagaaga gtgggtcatg gccccagcag 
tgagcccagg gaggtcagag acggaggtgt gtgtgtgggt gtgaccctgc gcagttccct 
gccggctgta gttttttgca ttcgcttaat gtttctcgtg gaggaaattg tgcatgagca 
aatgtgaaac cgtgctgtgc tcaaattgtc ctaatacatc attgcattgg aacagattgg 
ctttnttttt tttttttttt tttttttttt tttgaaatgg agtctcactc tgtcaccagc 
ctggagtgca gtggcatgat cttggctcac tgcaaccttt gcctcctatg ttcaagtgat 
tttcctgcct cagcctcctg agtaactggg attacagggc atgagccacc gcggccggcc 
agatttgcat ttttgaaaca actgctaggc tgggcgcggt ggctcacacc tgtaatccca 
gcactgtggg aggc.cgaggc aggtggatca cctgaggtca ggggttcgag accagcctgg 
ccaacatggt gaaaccccgt ctctactgaa tatacaaaaa tcagctgggt gtggtggcgg 
gtgcctgtaa tcccagctac tcaggaggct gaggcaggag aattgcttga acccaggagg 
cagaggttgc ggtgagccga gatcacacca ttgcactcca gcctgggcaa caagagcaaa 
actccatctc aaaaaataaa aaatagaaaa acaagtgctg tagcggaagt gagcactttg 
cggagtcagg cttgtgtggc ctgttccaca aatgatgtgc tcacggtggc ctcaggccca 
cctggagtct gcagcatggg gcacaacagg ttcattagtg tagaattcca ggacaggcct 
ggctcctaag cagccttctt ttacaaaaac tgcagagccc gcctgtatcg tagcactttg 
ggaggccgaa gtgggtggat cacgaggtca ggagttcaag accagcctgg ccaacatggt 
gaaaccccat ctctactaaa tatacgaaaa ttagctgggt gtggtggcac gcgcctgtag 
tcccagctac tcgggaggct gaggcagaat tgcttgaacc tgggaggtgg aggttgcagg 
gatctgagac catgtcattg cactccagcc tgggcaacag agcgagacgc catctcaaaa 



37860 

37920 

37980 

38040 

38100 

38160 

38220 

38280 

38340 

38400 

38460 

38520 

38580 

38640 

38700 

38760 

38820 

38880 

38940 

39000 

39060 

39120 

39180 

39240 

39300 

39360 

39420 

39480 

39540 
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I: 



aaaaaaaacc tacagagcca cacggcctct ttctccaccg agtgttggtg tgggagcttg 
tgttattgtg gtgaaatctt ggtactttct tgaggcagag agaggctgag cgcctggaga 
gactttcaca tgggtcgcca tgtccgccgt cggtttcgct gttgtgctcc ccatctgaag 
gctggtgccg tccagacagg ctggacgccc ctttccacca gatccttcct cccgcagcag 
tttctagtta cgttgtactg tgaggtctgt gtccttggtt gatggcaaaa gtcagccgaa 
ttgaaattca gagccatgcc tggctccctg gagcttctct cctgggcagc tgtgatcatt 
gcctctgctg tggtgtgggt ggtggaaatg gattcctttc atcttgcttg ctacaggtga 
ctgtcacgtg gagtcctttg gagagaggga cgtgttaatt gatggatgtg gctcccatgc 
tgagaaagct cctgggcgta cattgcctta gagtttcatt ggagctgcgt tcttttatgg 
tgtctgctag gcagaagtga tgaagacttg gaagaaaacc cagaaggttt tccacttaat 
ttggaaaatg tgcttttccc ctcctgtgtc ttttgctaag gtccagcctc ctgcagcctc 
cccgctctgt ggactctggc tttgattctt tattaggagt ccccctgctc ccccaaaaga 
tggtgtctaa attatcatcc aattggccga ggttttgttt tctattaatt gtttttattt 
tttattgtgg taaatttata taacataaaa tttgccattt taattgtttt gttattgttg 
tttttgagac agggtctcac cccagtgccc aggctggagt gcagtggtgc gatcatggct 
cactgcagcc tcagcctcca gggctccagt gatcctctca cctcagcctc tctagtagcc 
gggactacag gcatacacta ccacatctgg ctgatttttt gtattttttt tttattgtag 
agacccgcta tgttgcccag gctggtctca actcctggac tcaagccatc ctcccacctc 
accctcccaa agtgctggga ttacaggcat gagccacaac acccagccat tttaattttt 
. tttttttttt ttgagatgga gtctcactct atcgcccagg ctggagtgca gtggcgtggt 
atcaactcac tgcaacctct gcctcccagg ttcaagcgac tctcctgcct cagcctcctc 
ccgagtagct gggattacag gtgcccatca ctatgcctgg ctaatttttg tattttttag 
cagagacggg gtttcaccat gttggccagg ctggtcttga actcctaacc tggtgatccg 
cccgcctcgg cctcccaaaa tgctgagatt acaggtgtga gccaccgtgc . ccggcctttt 
tttgtttttg agacagggtc ttgccctgtc acccagactg gagtgcaatg gtgggctctt 
ggctcactgc agcctccgcc "tcccaggctc aagttgtgca cctccacacc tggctaactg 
tattttatgt agagacagat ttcaccatgt tgcccaggct gggcttgaaa tggactcaag 
cagtccaccc acctcagcct cccaaagtgc tgagattaca ggcgcgagcc accgcaccca 
gcccatttta cctattctgc agttgacagt tcagtggcat tcagtcagtt cacgaggtaa 



39600 

39660 

39720 

39780 

39840 

39900 

39960 

40020 

40080 

40140 

40200 

40260 

40320 

40380 

40440 

40500 

40560 
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40860 
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ccatcactgc 


cattcatctc 


cagactactt 


caccttctcg 


gcagatgtcc 


gaaactgtcc 


41340 


gcattgaaca 


cactcctcat 


ctccctctga 


cagccaccat 


tctactttgt 


atctctctct 


41400 


gccttctcta 


ggtacctcat 


gtaagtggaa 


ttataccaat 


atttgccctt 


gtgtgactgg 


41460 


cttctttcat 


gtgacatggt 


gtcctcaagg 


ttcatctgtg 


ttatagcctg 


tgtcagaatt 


41520 


tccttcctta 


aagcctgaat 


aataacccgt 


tgtaaaggct 


gggcgcggtg 


gctcacaccc 


41580 


tctaatccca 


gcattttggg 


agtccgaggt 


gggcagatca 


cttgaggtca 


ggagtttgag 


41640 


accagcctgg 


ccaacatagt 


gaaaccctgg 


ctctactaaa 


agtacaaaat 


tagctgggtg 


41700 


tggtggcgcg 


cacctgtaat 


cccagttact 


caggaggctg 


aggcaggaga 


atcgcttgta 


41760 


cccgggaggc 


agaggttgca 


atgaaccaag 


attgtgcctc 


tgcagtccag 


cctgggtaac 


41820 


agagtgagac 


ttcctgtctc 


aaaaaaaaaa 


aaaatcatcg 


gatggatgga 


cggaccactt 


41880 


cttgttattt 


atccatccac 


gggtgctagg 


tttcttccac 


ctttggttgt 


cgtgaataag 


41940 


gccactatga 


acatttcctt 


ccgtggtgaa 


ggttttgtac 


tagtgaggaa 


aaggcgtgtt 


42000 


tgtggtgttg 


cataggattc 


tggtaagaaa 


gtttgcacta 


accataagta 


tttgtactac 


42060 


attaaaatga 


aagctcaggg 


gccgggcgcg 


gtggctcacg 


cctgtaatcc 


cagcactttg 


42120 


9gaggccagg 


gcgggcggat 


catgaggtca 


ggagatcaag 


accatcctgg 


ccaacatggt 


42180 


gaaaccccgt 


ctctactaaa 


aataccaaaa 


aactagccag 


gtgtggtggc 


gggcacctgt 


42240 


agtcccagct 


acttgggagg 


ctgaggcagg 


agaatggcgt 


gaacccggga 


ggcggagctt 


42300 


gcggtgagcc 


gagatcgctt 


cactgcactc 


gagcctgggc 


aacagagcaa 


gactccgtct 


42360 


cacgcaaaac 


tctgtctcac 


gcaagactcc 


gtctcaaaaa 


aaaaaagagt 


tcagggttta 


42420 


tgaaactggc 


cagccgcgta 


aagtttgctg 


tgttgttttt 


gtgcccggga 


ggagtgtggc 


42480 


cagggtgtca 


cgtcacacag 


tacacgtttc 


tcagatggtg 


gttctccaga 


ctgctgtccc 


42540 


aaagtctgtt 


tttgcatctg 


gttcccacag 


acccaccctc 


cacggtgagc 


ctgattttgg 


42600 


ccagggtagc 


tggaatcttg 


cttgtctttc 


agcccggcag 


ctgtaccagt 


ccagggtcca 


42660 


cagctagtgg 


cttttaggaa 


ggaatttgtt 


cagttggctt 


tgacacatgg 


ccccctaggg 


42720 


tccacagctc 


tgtagtgatg 


tggatgttgt 


tatctacaaa 


gacacatgat 


ccttcgtgtc 


42780 


cagatgaaag 


tgatgatgtc 


tttgcagctg 


cccagcaagg 


ctgtgtgtgt 


gtgtgtgtgt 


42840 


gtgtgtgtgt 


gtgtgtgtgg 


tgtgtgtgtg 


gtgtgtgtgt 


gtgtatgggg 


gagggaggca 


42900 


ccctttccat 


ctgggggtgt 


gtgtgtgtgg 


ggtgtgtgtg 


tgtgtgtgcg 


cgtgtgtgtg 


42960 


gtgtgtggtg 


tgtgtgtgtg 


tatgggggag 


gcaccctttc 


catctgggtc 


caagagactg 


43020 
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43080 
43140 



43620 
43680 



ggcctgggga agacgcttct ttttatctac. ttagagactt tgttttattt gtattttttt 
gagacagggt ctcactctgt cacccaggct ggggtatggt gatatgagca tagctcactg 

cagcctcggc ctcccaggct gaagcgatcc tcccacctca gccttctgaa tagctgggac 43200 

tgtaggcgtg cgtcaccata ctgagctatt gttttttttg tttggttggt ttaatttttt 43260 

ttgatacaga tggagtcttg ctatgttgcc cagactagtc tcaaactcct gaactcaagt 43320 

gattctccca cctcagtttc ccgacattct gggatcacag gtgtgagcca ctgctgtctc 43380 

cctgttttat taactgctga aagacctaga taaagaaagt ctgaaaagac ttactatcag 43440 

agcaccatcc taagatgatt ccctctgact caatggagag ggaggggagc ttttccttca 43500 

ggcctgggtg gcaggagccc aggtgctcca ggccccattt gccccaggcc aaatcactcg 43560 
ggaacttgga tgcagctgtc tttcagggta acccaaagga accagatccc cgcaggcagt 
aggcttctgg gctgtcctct cctcctacgt cagctcagta agagcccttc gaagggatgc 

tgtgtcggag gccccaaaag cccaggctca tccctgagat gcacagggtg ggctgggctt 43740 

aggcagcgct cgagcatctc ctggacggtg accccagaga gtgtggagac ggagagtcct 43800 

tgagagtcac tgagagacgt ggctgccctg ccttcccaag aggggctctg agtcattccc 43860 

cacactcacc tgcccctacc caccctcacc tggcccccag cctcacctac ccccacatct 43920 

gtaccgatcc ctttacccgc accttcccta cccaccctca cctcccctgt accttcacct 43980 

cccccactca cccgcccctg caccctcacc tgtcccccac cttcacctaa cccccaccct 44040 
cacctgccct cccctcacct ggcctccttc cgttggggaa ggggttgtaa ggggcggccc 
ccaaactgtc tgtcctggtg ccctgcagag aaaacagtac gtgagggccg cagtccaaaa 
gcttgagtcc tggaaggtgg aggagacagg gatgtgttgg gaagggcccc atggtcttgg 

atccctfctc gactgtcaat ggggccttca tgggagcgcc agtctagtga tgcacagctg 44280 

ggtgcccggc gggtggctga ggaggcctaa agtccgaggc ggcaagagct cttccagagg 44340 

ctgttgtcct aatcgctctg gcatactcag gcgggcacgt agttaggagc tgattggaga 44400 

ggagagacc'c ccacaccaat actgggattt gactttcagg ctaaacttga gaagtgtggc 44460 
ctctgctgtc ctgccagagc tctccagcca gtgcccaggg ctctccagcc agtgcccggg 
ggtctccacc agtgcccggg ggtctccgcc agtgccaggg gtctccgcca gtgcccaggg 

gtctccgcca gtgctcagga gtcttggttt ctttgtctta cagccctttg ttttgacctc 44§40 

tctgagccaa ggccaaaacc cagacaggca gccccacgac ctcagcatcg acatctacag 44700 

ccggacactg ttctggacgt gcgaggccac caataccatc aacgtccaca ggctgagcgg 44760 



44100 
44160 
44220 



44520 
44580 
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44820 
44880 



ggaagccatg ggggtggtgc tgcgtgggga ccgcgacaag cccagggcca tcgtcgtcaa 
cgcggagcga gggtaggagg ccaacgggtg ggtgggggtg ctgcccgtcc aggcgtgccc 
gccgtgtctt ctgccgaatg ccagcctctc acaggctggg gagactttcc accctgggga 44940 
tccaatgggt ggctttccag -ggtcccaaaa -gcaaacacag . gctctttcac agcccctcca 
ggaaagcaga aagccccaag ggctggaagg gaagggggag ctctgctgag aggttacaag 
gcagcgctgg ccgacgggag ttgcagttga taggttttgt atcatccttg ttaaacttga 
accctgtgca gaaatccctt ccacggcatg ggggctgcct gttgactcgc tcctgttcca 
ccacagggag ctcctgggct tcttcctccc agaggccccc gacgctccca cctgttggtc 45240 
gtcagagctt ctggttggtg ggaaggcacc caggaccttg aggtctccag agagaaaagc 
cagggaaaga gggagaccga aacccatgtg acatgaaact caggctccaa actgagcacg 
ggaacgtttg gggacaggag ■ cgcgatggcc ttcctcagat agctgggggg ctggcatgaa 
gacgggagct acagccagca caggtcctgg gccgggagcc cagagattga gccctgactc 45480 
tgtcacttac tggccacgtg accttgggcg ggtggcatag cctcttggag actcagtttc 
ctcattggta ggagtgacgg ccacagtggt gcggcctctg cagcacacgg ggggctcggt 
gggcggaagc cccgggtcta taaggcggct gtgcaggagc cagccgagct ggtctcccaa 
cagccagggc tccggggtcc ttagcagctg tggggggcct gcacctgttt cccatggctg 45720 
ctgtcagaaa ttaccagaag ccaggtggct gagagtaatg gacacttgtt ctctcacagt 45780 
tcctgagggc tgaagcccga gatcgaggtg tgggcagggc cctgcgccct ctgaaggctc 45840 
tgagggaacc tttgggcttc tggtggctcc aggcacccct tgacttgtgg tcctgtcact 45900 
ccagtctctc tgtctggctg cacatggcgt ggcctcttct gtaccattga aggacacttc 45960 
agttggattt agggcctacc ctcacccatt gtggtcgtat cttgatcctt catgacattt 
gtaaagaccc tgcttccaaa taagctcaca ttctgaggtt ctggggtgag cgggaatttg 
gagagcattg ttcaactagt atagaatgtg acctgtcagc ctcgggcagc cctgagaggc 
aggggctttc cacagcccag ctgggtgccc tgggctccgt gctgtccgag gagacgccat 
ccccacaccc gtccttcacc cgccaccctc ccgcaggtac ctgtacttca ccaacatgca 
ggaccgggca gccaagatcg aacgcgcagc cctggacggc accgagcgcg aggtcctctt 
caccaccggc ctcatccgcc ctgtggccct ggtggtggac aacacactgg gcaagctgtt 
ctgggtggac gcggacctga agcgcattga gagctgtgac ctgtcaggta cgcgccccgg 
ggcctgccct aaccgcagac acccggcctt cattgtcagt aatggcagca gctgccacat 46500 



45000 
45060 
45120 
45180 



45300 
45360 
45420 



45540 
45600 
45660 
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46080 
46140 
46200 
46260 
46320 
46380 
46440 
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46560 
46620 
46680 
46740 



46860 
46920 
46980 
47040 
47100 



tgtccgagac ctgccgtgag cccagtgccg cgccaggggc tttgtgtgta gcgtgttttg 
tcctcacact gacagctgta ggctggggtt ctgagtgagc cccacagggc agaggcagaa 
aatgagtctc agagagggtg agcgagctgc ttggggcccc acagcaggag atggagcagg 
actgcagcct agcctctgcc cccagcacct gcgcaagaag ctgctctgct ctggactgtg 
ttaggctgcg agggctggag agaaatgaga gttggtgctt agagaggggg cgcaggtccc 46800 
catggctttt cctcttatga tgaggtagat gggtgaaggg aggggccatg cttgcagggg 
ccagtgaccg aggcccgccg ttggaactga tggccttcat cccgagccca gcccaggtgg 
gagcagggct ttccgagggc ttgtcttggg tcggcctgct tccagggact ctgctgcagc 
tcccacccc-f gtccaaagca tggaatcccc caggctccct ggcagtcctg tcaacctctg 
tcctcccaag ctgagtgtgg ggcaagttct ggaggtcagc actgctcagg ggggcccacg 
ggctgcttgc aggggccaac cgcctgaccc tggaggacgc caacatcgtg cagcctctgg 47160 
gcctgaccat ccttggcaag eatctctact ggatcgaccg ccagcagcag atgatcgagc 47220 
gtgtggagaa gaccaccggg -gacaagcgga- ctcgcatcca- gggccgtgtc gcccacctca 
ctggcatcca tgcagtggag gaagtcagcc tggaggagtt ctgtacgtgg gggctggcag 
tggggtgggc agggtggcct ctaaacccga cccctggagg aggctggagg ccagtgcaag 
atcctgtgtg gcctcagcca ggcggtggtc tctgccagat gccaactgtt gcccgctggg 
gttcagcgac atgtccgaat gtcccgaggc ctctgaggtt gttttctttt gccgcagaac 
aaatcaccac gaacagcgtt ttaagacaac accaactctt tttttttttt ttttttttga 
gtcaggatct tgctctgttg cccaggctgg ggtgccctgg tgcaaacaca gttcactgca 
gcctcgacct ctgggcttaa ttaagtgaac accttgcctc- agcctcccag gtagctggga 
ctacaggtgg gcaccaccac acctggctaa tttttttttg tagagacggg gtttccccat 
gttgcccagg ctggtctgca actcctgggc acaagctatc tgcctgctgt ggcctcccaa 
agtgctagga ttataggtgt gagccactgg cctgacaaca cccacggatt gtctctcagt 
tctgtaaggc aaagtccagg cacagcgtgg ctcacctggg ttctctgctc agggtctcac 
ggggccagaa tcaaggtgtc aggaacgctg ggccctcagc ggaggctctg tggagaaatt 
agcttccttg ctcactcagc aggtagcagt tgtgggatcg aggttctgtt ttctctctgg 
ttattggtcg- gggaccactc tcagctccta gaggccaccc caggtccttg ccccgtggcc 
ctctctgcct cagcagtggg ggctccctgc gtcagtccct cccgcacctt gagtctctct 
gatttgcttc taaagggccc tgtgattcgg ctcagccacc tttagattag gttagcctcc 



47280 
47340 
47400 
47460 
47520 
47580 
47640 
47700 
47760 
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47880 
47940 
48000 
48060 
48120 
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48240 



121 



48540 
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cctttgatag . actccaagtc ggctgattaa taaccttact cacatctgca gaatcccttc 48300 

tgccacataa ggtcatgacg ccgtgctggg gactggggtg ggaaattacg gggtcattta 48360 

ggattctgcc tgccactgcc ttgctgtgtc ccagggcttg ggggaggggc ctccacagct 48420 

gggaccacagtccttcctcc cctccatggt aaccatctga ggattacttg agaccagcct 48480 
gggcaacatg gtgagaaccc atccctacaa aaaatacaaa caaaaaggga ccaggctggg 
cttggtggct catgcctata atcccagcac tttgggagac caaggtgggc tgatcacttg 

aggttgggag ttcgagacca gcctgcccaa catagtgaaa tcccgtctct actaaaaata 48660 

caaaaattag ctgggtgtgg tggcaggcgc ctgtattccc agctactggg gaggctgagg 48720 

tgggagaatt acttgaacct gggaggcgga agttgcagtg agccaaaatt acgccactgc 48780 

actccagcct aggcaataga gtgagactcc gtctcaaaaa aaaaaaaggg ccaggggtgg 48840 

tagtgacaaa gagaccctat cccaaaaaaa ccgaacactg aatccttgag actgagtaag 48900 

gacactgtga aatttttctg ggtggggcag ggaacagagc gtcttctgtc atttcttcca 48960 

cctgggtgtg gtcagctctc cctccaagct gcctcctctt cttctcattg tccgggtgtt 49020 

ggacacattt ggttaactgg atagaataac gcgagttccc agggacttgg tccatttgct 49080 

attttatttt attttatttt attttatttt atttatttat ttatttattt atttatttat 49140 
tgagatggag tttcgttttt gtcgcccagg ctggagtgca gtggcgcgat ctcggttcac 
tgcaacctct gcctcccagg ttcaagtgat tctcctacct cagccttcca agtaactggg 
attacaggca cccaccacca taccaggcta atttttttgt atttttagta gagacgggtt 

ttcgccattt tgcccaggct ggtcttcaac tcctagcctc aggtgatcca cgcacctcgg 49380 

cctcccaaag tgctgggatt acaggcatga gccaccacgc ctggcaccat ttgctatttt 49440 
aattcccatg tgtattagtg tcccacggct gctgtaacaa atgaccacaa actggatggc 
ttaaagcaac agaaatggat tcccccaatg tgctggagac cagaagcctg cgaccaaact 

gttgggaggg ctgtgcttcc tctgggggct ccagggagga tctatttgtt ggcccttcca 49620 

gtgctgtggg tgccagcgtt ccacacttgt ggatgcgccg cctcaacctc tgcccatctt 49680 

. catgtgtcca tctcctttgt gtctgcgtct ttacctcttc ttcttgtctg tgttgcctct 49740 

tataaggacg tttgtcattg ggtttagggc ccacccaaat catccgagat gacctcgtct 49800 

tgagatcctt aacctgcaaa gacccttttt ccaaaaaaag gttatgctca cagattctag 49860 

gccttaagac atgggtgtat ctttctgggg ggcactatcc aaccccttat acaatgaaag 49920 

acgggaagag ggccaggtgt ggtagttcac gcctgtaatc tcagcacttt aggaagctga 49980 



49200 
49260 
49320 



49500 
49560 
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agcgggagga tcacttgagc ccaggagttt acaagtagct aggcaacatg atgagacccc 
atttctacaa aaagtaaaaa aaaaaaaaaa aaaaaaaaag ccaggtgtgg tggctcacac 
ctgtaatccc agcactttgg gaggctgagg caggcagatc acgaggtcag gagattgaga 50160 
ccatcctggc taacacggtg aaaccccgtc tctactaaaa atacaaaaaa ttatggccgg 50220 
gcgcagtggc tcccgcctgt aatcccagca ctttgggagg ccgaggtggg tgaattacaa 
ggtcaagaga tcgagaccat cttggctaac acggtgaaac cccatcaaga tcacaaggtc 
aagagatgga gaccatcctg gctaacacgg tgaaaccccg tctctactaa aaatacaaaa 
aattagccgg gcatggtagc gggcgcctgt agtcccagct gctcgggagg ctgaggcagg 
agaatggcgt gaacccggga ggcggagctt gcggtgagcc gagatcgctc catgccattg 
cactccagcc tgggtgacag agtgagactc cgtctcaaaa aaaaaaaaaa aaagaaaatt 50580 
agccaggcac agtggcaggt gcctattgtc ccagctactt gggaggctaa ggcaggagaa 
tggcatgaac ccgggaggtg gagtttgcag tgagccgaga tcatgccact gcgctccagc 
ctgggcgata gagcaagact ctgtctcaaa aaaaaaagcc aggcatggtg gtgcatgcct 
gtagtcccag ctacteaaga ggctgaggca ggagggttgt tcgacccacg gagatcaagg 
ctacagtgag ccatgatcgc accactgccc tccagcctgg gtgacagagt gtgaccctgt 
ctcaaagtaa gtaaatagga ggagagacaa gtgggcagtt cagactgatg gtatgggcac 
agtagagact ggtgcagaca ggctggcctg tgatgtcaag caacttctgt aactgtttcc 51000 
ggcatccatt tgtgtgtcaa tttccgtgtc agtaggaaga ctctgtaggc tgccaagagg 51060 
aataagtggg aggatcctcc cagagaggcc gggcctgcag gagggccagt tctcatgagt 51120 
tcttatttgg cccctaccct ccaggctgtg gttctgaggt gggagacaga gcctgacctc 51180 
tgtttgtctt gttttgtctt tgcagcagcc cacccatgtg cccgtgacaa tggtggctgc 51240 
tcccacatct gtattgccaa gggtgatggg acaccacggt gctcatgccc agtccacctc 
gtgctcctgc agaacctgct gacctgtgga ggtaggtgtg acctaggtgc tcctttgggg 
tgatggacag gtacctgatt ctctgcctgc taggctgctg cctggcatcc ttttaaaatc 51420 
acagtccctg tggcatccag tttccaaagc tgattgtgtc ttcctttgcc ctcctttctt 51480 
ttctactatg tgcattcggt gctatgaatt ttcctctaag tactgcgttt cctgcatctc 51540 
acaaattttg ttacattttc attttcaggt agtttgaata tttttacact tctcctgaga 51600 
tgacatcttt ggctcatgtg ttatttagaa gtgttgctta gtttctaaag agttggggct 51660 
tttccagctg tctctctgca actgatttct aatttaattc tactgtagtc tgagagctta 51720 



50280 
50340 
50400 
50460 
50520 



50640 
50700 
50760 
50820 
50880 
50940 



51300 
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ttttatatga tttctgttat tttaaatgtg ttgggtgtgg tgtttttgtt gttattgttt 51780. 

ttgtgtcttt ttgttttgtt ttgcttcgtt tgttttgttt ttgagacagt gtcttgctct 51840 

gtcactcagg ctggagtgca atggcgcgat ctcagctcac cgcaacctct gcctcccggg 51900 

ttcaagtgat cctcttgcct cagcctcctg agtagctggg attacaggtg cacgccacca 51960 

tacccagcta atttttgtat ttttagtaga gacggggttt caccatgttg gtcaggctgg 52020 

tctcgaactc ctgacctcgt gatccgccca cctcggcctc ccaaagtgct gggattatag 52080 

gcgtgagcca ctgtgcctgg ccattaggtg tgttttatca cccagcatca tgcagtttat 52140 

cttggtgaat gttctgtgta ctcttgaaaa gaatgtggat tctgctgttg ttgggtggag 52200 

; tgttccagaa acatcaatta gatccagttg gttaatagtg ctcatcaggt tgtctctatc 52260 

cttccttcct gactgcctgc ttgagctgtc agttattgac aggggtgtgg agtctccaac 52320 

tctaatggtg gatttgttta tttctcctag tagttctatc tttttctctc cttctaccct 52380 

tgatcctctt ctccccctag ggcttcctgg tgttggtggt gggagagtgg ggtagtgaag 52440 

aacctggact ttagggccaa agaggccagg gttcaaatcc tggctctgtc acttcccagt 52500 

tgagtgaccc tggctggtgc ctgaatctct gtgagcctcc acttcctcct ctgtgaaatt 52560 

gagagcactt acctggcagg ctgtcatggg catcaagtaa cagggcactc cacctggacc 52620 

ctgacacgtg atgcacagga atgccagctg ctatgccatg ggtgtggcag tagtaataaa 52680 

gtgaccatct gtatcctcac cacagtgaag cctgtccagg gctttctctc ctatgccccc 52740 

atgcctccag gtggccttgg atcctgttgg ttctgtgctc tgctcagcga cctttctccc 52800 

gtgggagttc ctgggggttc agcttcatcc tacagacagc agcacacact ggctgtgcac 52860 

cctttttttt tttttttttt ttttttttga gatggagtct cgcttttttc gcgcaggctg 52920 

aagtgcagtg gtgtgatctt ggctcactgc aacctctacc tcctgggttc aagtgatttt 52980 

cctgcctcac cctcccaagt agctgggatt acaggctccc accaccacgc ccggctaatt 53040 

tttgtatttt cagtagagat ggtgtttcac catgttggcc aggatggtct tgaactcctg 53100 

acctcaggtg atccgcccac ctcagcctcc caaagtgcag ggattacagg cgtgagccac 53160 

cacacccgga gtgccggttg tttttagcag tttgtcttgt tcctggagag actggctcct 53220 

gcccaggagc tcggggagta gggccgcggg gtgctgcctc acacctcgag tttggccgta 532 80 

agcagagggg acattttgtg actgtccccc tcctgagctt cccagcagct tttctccaag 53340 

ttacagccca aaagctcagg tggatttgca acccaacggt gtctgtgcac ctcccactga 53400 

tgcccgaact gccctggcca agaaacgggg ccgtcagaac gctgcactaa ctgcagcctt 534 60 
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53520 

53580 



53700 
53760 
53820 



53940 
54000 



gggcctccat gccagaggcc atgcccttcc atccaccacc ccctggcctg ggccctggcc 
ctcctggctc gggaactcca ggccccttcc tcacggatcg agagacgtgt atttaccgca 
caggtgcttg tcattctctt gtggcctctt ctccagggag atcacagaag gacagggcct 53640 
cactgaggtc tcggacatgg accctttgat agtggcagga gccaggctgg gcaagaggcg 
gccacagtca cctcagcagt gccatcacca ccgccattca gcccttccct gagccgggcg 
cgcccctggc tctggcccca gtgtcccagt tacagctcac aggagcttgt ggtgcccagc 
ggctgcttct gattgagagt cgaggtcgga ggctttggga ggctgagagg ctgctcggtt 538.80 
tcacaactgc tgagggagac ttgggctcca tctcaggtct gccccatgtc gccctcaacc 
tccagccacc ggtcctccgt gtcccccatg gccaggcacg gcttgcagac atctgtcgtt 
ggctcctctc agccgtcgtg ggctgaccct ggcacgtcct cctgtggctg agcccagtgg 54060 
ggacagctgc ttccttttat taccctagaa ctctcgtctt tgatcaggcc ccctccccta 54120 
tgccacacag tccctgtcac tcgggtgagc ccagtagtca tggggaaggc ctgcgggttc 54180 
■caaacatcca aaggcttgcg tgcagcatga cagcttgaaa ccgatgtttt ttaccttgat 54240 
cagatttcag cttggcgggg gctttgctca gctttcagtg aggcctgggc cgatttccca 54300 
gcatcccctc- ctgaggccag cctctgtttc ctgtgatttt ctgcacaaag tgggagggag 54360 
gagtcttagg aaatgggggg ccacctcgaa acctaggcct cctctggctt ctctgtgcca 54420 
gtgcccccac gctttgtgtc tgtgtcccca gccca£ggga ctgtgttatt ccctgagtgc 
tgccgcatgc ccagcccgca ctgaggacgt ggagccccga ggggcaggat ggcctccatg 
gtcacacgta ggaagtggcc tccaccctcc gatgatcctc tccccccctc cctttcagcg 
ccttccccgg gggtgtcatc agccctcctg cctgtgcttt gtcccgtctt ctgcaggcgc 
atgggacgtg ctgacaggtc ctctgccggg ttcctgcctt gctatgcgca cgctggtcac 54720 
cacagaggcc tggcccttct tctgtagcag tcccacaccc gcaacaggtg tggctgctga 
ccacctgctt tctgcccctc tggtcctgag gagggcgcag tgggcactca ggcgtggctg 
agcagatgtg tgttgccggg aggaggaagg actgctccag tcagggctga atttcccacc 
cggagcattt ctgctgtatt tggtgtagcg cctgctgctt aaagctctga ttcccagttg 54960 
gcaccctttc ccttctgcat tgaaaaacat acggatgcat gtcttcttgc agtgaatgtg 55020 
tattctccca gcctctcttc tgggttgggg ctggaggtgg agcggcacac aggagccgca 55080 
gcgatggagg atgtgcgggt gcagcacccc gtacagcagg gatgccaaac ccgcgctgag 
tccctctcaa cttctgcttt gaagcccagt cacgccattg cctgggtttt gctgggcggg 



54480 
54540 
54600 
54660 



54780 
54840 
54900 



55140 
55200 
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gctgcatgtg atgttctcct ctgtccctcc cccagagccg cccacctgct ccccggacca 55260 

gtttgcatgt gccacagggg agatcgactg tatccccggg gcctggcgct gtgacggctt 55320 

tcccgagtgc gatgaccaga gcgacgagga gggctgcccc gtgtgctccg ccgcccagtt 55380 

cccctgcgcg cggggtcagt gtgtggacct gcgcctgcgc tgcgacggcg aggcagactg 55440 

tcaggaccgc tcagacgagg tggactgtga cggtgaggcc ctccccgtca aggctctgcc 55500 

aagaccctgg ccctgccctc cgggatacga gcttggggct gcctccggcc tcacaggagt 55560 

aggggctctg aaaacctttg cttgcaggga gattgccaag tctgtctttt aggcccaaca 55620 

aggaaaactc tgcagttcca cccatcctgt cccaccaggt agtgtggctt gaaggcagac 55680 

tgtgagggtc tatctcacct tcctgcatta ggtcaggagt ttcacagaaa cctgaggcac 55740 

attcaggggt gggctgcaga ggtccatggc tcacaccctg gaaaatccgc ccccaaaaga 55800 

cagtgctgtc tccactgacc agtctgtggg atagtgctta agcctgagtg gtttctatca 55860 

acatgtagaa tcaggaggta taaagagatt tgctcaggca tcctgggccc tctctgacca 55920 

gcaggatctt cctttagatc ttgacagtga aacacatctc ttctgtgccc cctgtgagtt 55980 

ttctttcatt cattcattca ttcattcatt cattcattca ttcgagacag agtcttgctc 56040 

tgtcacccag gctggagtgc cctggtgtaa tctcggctca ctgcaacctc tgcctccagg 56100 

gttcaatcga ttctcctgcc tcagcctccc gagtagctgg gatgacaggt gcgcaccacc 56160 

atgcctggct aatttttgta tttttagtag agacagggtt tcaccatgtt ggccaggctg 5622 0 

gtctcgaact cctgacctca ggtgatccgc ccgcctcagc ctcccaaagt gctgggatta 56280 
caggcatgag ccaccgcgcc cggcctgagt tttcctttta tgaaggacct gcttggttgg 
ttgcctgcca catgttgtca gcaccatggg cccaggactg ctgaggagct gttgatgccc 
tcgctctccc agagccaccg gctctgttag ataattcaca tgcagtctgg ccactgtcct 
acgtcctcat tcacaaagag cagacatttc gtagaagatg agggcctggg agtaacctcc 
ctgcatgttt ttctataaag gcatagtggt taagtccttc cagctcattg accattggag 
aattttatgg aggctgtaga ctaggggctg gtaaactaag ggcccagggg ccaaatccag 

cctgccacct acttttgtaa ataaagtttt cttggtgcac agccatgccc attcattcat 56700 

ttgcacaatg tctgtggctg ctttcatgcc aaaagcagga gaactgagtg gttatgctgg 56760 
agacctacgg ccttcaaagc cccagacctc acgtctggcc cttgacagac agagcttccc 
cagccctgct gcgcatcctg gcccagcatg tgctgtgtgt gtgatttcag cttgcaggag 
ccgtggttag gaattgtccc tgtgttggtc cattttgcat tgctatgaag gagcacctga 



56340 
56400 
56460 
56520 
56580 
56640 



56820 
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ggccgggtag attatgaagg aaagaggtct gtctggctca tggttctgta ggcagcacca 57000 
gtatggcacc cgcatctgct cagcttctag tgaggtctca ggaagctttg actcatggtg 57060 
gaagtcgaag cgggagcagg tgcatcacat ggtgagagag ggagcaacgg agagagagag 
agagagagag agagcgcctc tccctcttgc cctcaccttg agaggagatg ccaggctcct 
ttaagtaacc agctcceatg tgaactcaca gtgagagccc atttgctact gcggagaggg 
caccaggcat ctgctcccat gacccaaaca ctgcccacca ggccctacct ccaaccttgg 
ggtcatattt tattctgttc tatgctatgc tatgctatgc catgccatgc catgccatgc 
tattcctatf ctattatttg ■ agacagaatc tcgctctgtt gcccaggctg gagtgcagtg 
gcatgatctt ggctcactgc aacctccacc tcccaggttc aagcgattct cctgcctcag 
■cctcccgagt agctgggatt acaggcacac accaccacac ccgggtaatt tttgtatttt 57540 
caatagagat ggggtttcac catgttggcc aggctggtct caaactcctg gcctcaagtg 
atccacttac ctcggcctcc caaagtgcca tgattacaga tgtgagtcac tgcgcccagt 
gagggtcaca tttccgttga gatttggagg ggcagacgtt ggagccatct gagccccctc 
gtcccgctct agcttctcct cccgtgtgcc ccgcggtgct ggtggcaggc ccttacgccg 
gttctggctg cacgctctgt tccagaagct ttcttccctg cttggttacc agaaaatcat 
cccatccatt acaaggacag ggtcccctta tctcccattc ccagggcagg acaccggggg 
cagggcaggt ggggaactga gcaagttctc tgggggcagg cgtggctatg gctccctctg 
ggtgggcgtc tggggagggg tggaggcagc cgtcagcgcc ctggcttgct cttcctccct 
ggccagagac tgtggccttg tgctgctccc gtgtgggctg cctgcacctc cagtgggttg 
tgctccctcc cctcccctcc cctcaagctc tgctgagcac cactgccttc cacagccccc 
actctcggga ggcgaggctc ctcgtggcca ttcctgtcct tggcacccac ccccccacca 
acctggtaga gccttgggcg gggtctgtta ctccttgcat ggcgtagacc tccccacagt 
aggcacctga cacatacctc ctggggggca ggcaggaggt gcgttgaggt ctcagccctg 
gcagtccctc ccctgcgtgg cataggcctc gccacagggt catcgagggt gggtggagac 
tgtactagac cactccccgc tggtcctaga aagggtccca tctgtctgct ctctgtttgg 
agtccagacc ttggttgctg tgccctgcat ggtgggctgg ggggcaccct ccagcctctc 
tgagtgcatg gcctctcctt gcagccatct gcctgcccaa ccagttccgg tgtgcgagcg 
gccagtgtgt cctcatcaaa cagcagtgcg actccttccc cgactgtatc gacggctccg 
acgagctcat gtgtggtgag ccagcttctg gcacggggaa ggggcgtccg ggctgggttc 



57120 
57180 
57240 
57300 
57360 
57420 
57480 



57600 
57660 
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ccccaggaac gtggagttta ggggaggaga cgtgcctttc cagcggggct gggggctgtg 
tgggagactc aggcggctgg gaggctcctt gcgggaggca gggaagcctt tcccagggca 
gcggccagga ggacagactg tgagctgtgg gctcggcggc tacagagtct gcctcagtgg 

gcggggctga tggtgtccag gtgcctgcag cacgcaccca cccacgggac cttgctgagc 5 8920 

agcgtctgtc aggcagcaag attacccgag ggctgcagtg gtcctgttcc ctggcagctt 58980 

actgtctggc tgaggaggag tgatgttcac atatgcacac atgtcatgtg cacacacatg 59040 

tacatgacaa catcccacat gctcctcaaa tagcatgacc tgtacagtca cggatatagg 59100 

gcctagggga taggaggcca agacagtcag ggaagacttt ccagaggcag tggctcctga 59160 

aaggctgtct gattcaggca ggaagggagc tgagttcaga taggaagtag caatgagtca 59220 

ttgtgtctgg ggacatggcc actccttcgc tgcagaggga cctgggctga gagctcctct 59280 

cttatggctg cagtcgggag agaagtctgt tggggggaga agggggcttc ctcaagggac 59340 

tccctgtgcc ctttggcacc ttcgtgccag gtcaggcttg aggcctgaag gcagtggtgg 59400 

gggccaccaa gggtcgcctc ctctgctggg caagttccca gtctgacggg cctgtgccgt 59460 

gggccccagc tgtgggggcg ctgttgatgc gcagccaggc ctcgccgcca gagcccgcac 59520 

gcttccattc cgctgacttc atcgacgccc tcaggatcgc tgggccggcc ctgtgggaga 5958 0 

gtgaatgtgg cttttgccaa agttgagtct ggagcctgga aacttcccta tgggcagcct 59640 

tgatagtgga gtggcccaag gagcccaccc agccgaccct gcccctcccg tggctggtgg 59700 

gcggcaccag gggctgcctg gctttgctcg ttcaccaaca tcacccgggc tggccagggc 59760 

gcgctcactt ctgccaccac cgagggccct gggcgaagga gtgaatacca ggctgccttg 59820 

gcagggatgt gttgagggct gtggggagtc ggacagcggc gggggtcaga ggaggaggag 59880 
ggtgcaccgt gcaggctgaa gggccacgtt accctgaggt tggccaggct ccccaggcct 
agcctcccag ctcccccact ttctccccac cctccaccag tggcaaagcc agccccttca 

gggcgcacgg tgtctgcccc caaggagggc ccattccgtt ggggttaatg ttggccacct 60060 
ctttctgttt gtctctggca gaaatcacca agccgccctc agacgacagc ccggcccaca 
gcagtgccat cgggcccgtc attggcatca tcctctctct cttcgtcatg ggtggtgtct 
attttgtgtg ccagcgcgtg gtgtgccagc gctatgcggg ggccaacggg cccttcccgc 
acgagtatgt cagcgggacc ccgcacgtgc ccctcaattt catagccccg ggcggttccc 
agcatggccc cttcacaggt aaggagcctg agatatggaa tgatctggag gaggcaggag 

agtagtctgg gcagctttgg ggagtggagc agggatgtgc taccccaggc cctcttgcac 60420 



59940 
60000 



60120 
60180 
60240 
60300 
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atgtggcaga 


cattgctaat 


cgatcacagc 


attcagcctt 


tcccactgag 


cctgtgcttg 


60480 


gcatcagaat 


ccttcaacac 


agaggcctgc 


atggctgtag 


caacccaccc 


tttggcactg 


60540 


taggtgtgga 


gaaagctcct 


tggacttgac 


cttcatattc 


tagtaggaca 


tgtgctgtgt 


60600 


tgtccacaaa 


tcctcatgta 


ccctagaaat 


gaatgtgggg 


gcggctgggc 


tctctccaga 


60660 


gctgaaggaa 


tcactctgta 


ccatacagca 


gctttgtctt 


gagtgcagct 


gggatttgtg 


60720 


gctgagcagt 


tacaattcct 


acgtggccca 


ggcaccagga 


acgcaggctg 


tgtttgtaga 


60780 


tggctgggca 


gccgcaccgc 


agagctgcac 


catgctggtt 


tgtatcacat 


gggtgaccat 


60840 


ggtatgtcta 


agaaggtgga 


gtccctgtga 


ggtctgcagg 


tgcccccaca 


gctccaggcc 


60900 


accttgagga 


ttgcctctgc 


ctgcccagcc 


ctgagttccc 


tctcccctgt 


cctgtcccac 


60960 


tgtcacccca 


agccggcctc 


attgggagcc 


tgttggatgg 


cagggtatag 


atgtaacctg 


61020 


attctctctg 


gggagcgggg 


ttatctggct 


tctcaagagc 


tcctaggagc 


ccacagtggt 


61080 


ggcaccatca 


cagtcgcagc 


agcccccaga 


gaacgcggcc 


ctgtctgttc 


ctggcgtgct 


61140 


ctgtgctgcc 


ccgcctgggt 


tacctgcccc 


agtcgcaggc 


cccttggagg 


aggtaccatg 


61200 


tgtctcccgt 


ttcacagatg 


agccccgggg 


agctcactct 


agtagtggcc 


agagaggcct 


61260 


gcggctcagg 


gagcggggca 


catttccaac 


aggacacacc 


gccctggtct 


gagtctcgtg 


61320 


ggtagtggga 


gcagaggaga 


gcgccctatg 


tctgtggggc 


ggcttggctg 


agcctggaag 


61380 


ccacctgacc 


tcccccgtcc 


cttccctgcc 


aggcatcgca 


tgcggaaagt 


ccatgatgag 


61440 


ctccgtgagc 


ctgatggggg 


gccggggcgg 


ggtgcccctc 


tacgaccgga 


accacgtcac 


61500 


aggggcctcg 


tccagcagct 


cgtccagcac 


gaaggccacg 


ctgtacccgc 


cggtgagggg 


61560 


cggggccggg 


gaggggcggg 


gcgggatggg 


gctgtgggcc 


cctcccaccg 


tcagtgctgg 


61620 


ccaccggagg 


cttcccgggt 


tcctgggggc 


tgtgccaccg 


cctctgaggc 


atgcttgctt 


61680 


tcttcccttt 


tcaaaccctt 


ctgcttcctt 


ctttaatgac 


attgttgatt 


gtggataatc 


61740 


tgaaaactac 


acaaaaatat 


aaagagccaa 


aatctcaccc 


aaatccacct 


cctagagtgg 


61800 


ctgttgggct 


ccgtcagcat 


ccaggcggcc 


gtctgtgttc 


cgcacggccc 


agcccatcga 


61860 


tagccgcctg 


caccaggcct 


gtctgccctc 


tgtgagcctc 


cccacagggt 


tccctccaca 


61920 


aacaccctgt 


tctcccaccc 


agggctggct 


gcttcctgga 


aaacagctgg 


atggttttgt 


61980 


gcatgacaga 


caaacacagg 


gtgattttcg 


tggctaaaat 


actccctgga 


gcttttggca 


62040 


gggtgagggg 


ctggctccag 


ctgagccacg 


ccttgagtga 


aatgactgtg 


aggagaataa 


62100 


actgccgctg 


ccctccagga 


tcactggggc 


tggctgggga 


gaacccccgt 


ttctgggagc 


62160 
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acagtcccag gatgccaagg cgagcttggt gccgagatgt gaactcctga gtgtaaacag 62220 

cgggggctga cttgacatgc tttgtatgct tttcatttgt tcctgcagct gtatgcccct 62280 

aaggtgagtc cagccccctt ctgcttcctc tggggcctcg ccagtgagcc ccaccttgct 62340 

ggggctggtt cctcctgccc ttctgggtat ccctcacatc tggggtcttg -tcttcttgtf 62400 

ttatttttct tttttttttg agacggagtt tcacttttgt tgcccaggct tcagtgcaat 62460 

ggtgtgatct ctaggctcac cgcaacctct gcctcccagg ttcaagcagt tctcctgcct 62520 

cagcctccct agtagctggg attacaggca tgtgccacca cgcccagcta attttgtatt 62580 

tttagtagag atggggtttc tccatgttgg tcaggctgat cttgaactcc ctacctcagg 62640 

. tgatccgccc accttggcct cccaaagtgc tgggattaca ggcgtgagcc accgcacctg 62700 

gcctttttct tttcttttct tttctttttt ctgagacagg gtctcgctct gtcacccagg 62760 

ctggagtgca atggtgtcat catggctaac tgcagcctct accttctagg ctcaagcaat 62820 

cctcccatct cagcccctaa gtagctagga ctgcacgcat gcatccccat gcccagctaa 62880 

tatttacatt ttttgtagag atgaagtttc actatattgc ccaggctggt ctccaactcc 62 94 0 

tggactcgag cgatcctcct gcctcggcct ccccaggtgc tgggattaca ggcgtgagcc 63000 

accgtgcctg gcctggggta ttgtcttctt atggcacctg actgtggtgg gccctgggaa 63060 

ggaagtagca gaagagggtt cttcttggtt tcctggacag taactgagtg ttctggaggc 63120 

cccagggcct ggctttgttt agggacaaag ggaactggta accagaagcc gagagtttaa 63180 

acacccactg cccttcttcc ctgctcctgc tgctgcaacc cagcttaacc agccaggagt 63 24 0 

gctaggaacc caagcagggc ccccgagcac acagcaggca gctcacgaat tctcttttcc 63300 

tgttctccct tgggagctgg gaggatctta atcaggcaat aagagatggc actgagcagc 633 60 

cagctaattt tttaaatcac tttattgttt aaccatatga ctcacccact taaaaaaggg 63420 

tacagttcag tgggttttag tgtattcaca gatgtgtgca accctcacca cagttaattt 63480 

tagaacattt tcctgcccct aaaagaaact ctgcatgaag ccagctgttt ttaaattagc 63540 

aaagttattt tgcatccttt aaatatatgt tcatggtaca aaattcaaaa gatacagaag 63600 

agtctgcagt ccaaagagac tccgccccca tgacgccaag caggactccc tgggaggcat 63660 

ggcctcctgc agtgtgtttc ttctatgtcc ccccaggggt catctgtaca tatgcaagca 63720 

tacaagagcg tggactttgt tttceaagcc agaagataat tgtagattta tgtgcagttg 63780 

tgagaaagag cacagaccca tttatcctct gcctggtttc ccccagtgct gcctgccatc 63840 

ttgcatgact tccattccta tcataagcaa gacactgata acgattcttt caccttattc 63900 
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agattgacat 
agtgcagtgg 
ctgcctcagt 
tttaaatttt 
agctaaagtg 
tgcgcctggg 
attgagcctt 
ggctcccagc 
ggtgctattg 
gcctgggtcg 
actctgcttc 
tggtggctga 
tcctctctcc 
tgaaagggac 
ccctttctct 
atatttattt 
cagcgtgatg 
ttgaggccgg 
atttaaaaat 
gcttgaacat 
gggcgacaga 
tcaatctcct 
ttcttggtta 
cactctctct 
gtggcgcaat 
taccttcccg 
tttagtagag 
gtgatccacc 
agcccacact 



aagtgttttt 
cacaatcaca 
cccctcaagt 
ttgtggaggt 
atcctcctgc 
ctgacatatg 
gccttttgct 
tgccgtgtgc 
ttttcctcat 
ttcggttggt 
ccccgcaccc 
acagcactcc 
tgctgcggtt 
agtccatctt 
ccacctgtct 
tatcatgaat 
gcttgcagct 
gagtttgaga 
tagccgggta 
gggaggcgga 
gcgagactct 
cttttatggc 
tgttcataaa 
cttttttttt 
ctcagctcac 
agtagctggg 
acgggggttt 
cacctcggcc 
ctctttctta 



tgtttgttct 
gctcactgca 
agctcagatg 
gaggcctcac 
ctcagcctcc 
tgttttcgta 
gctaatgatg 
accctgccta 
tttgcaacaa 
aggcagtggg 
tgaagtttgt 
ttgtcaggtg 
gaaccttggc 
ttctcagcag 
aatatagagt 
gatgtaagca 
gtaatctcag- 
ccagcctggc 
tggtgatccc 
ggttgcagtg 
gtctcacaaa 
atatatatat 
ggcctcccct 
tgagacagag 
tgtaacctcc 
attataggca 
ctccatgttg 
tcccaaagtg 
acgtcctcct 



tgagacaaac 

gcctcaaact 

gcaggtgtgc 

taaatttcct 

caaagtggta 

agcccgaaag 

tataaaagct 

gagctctacc 

ggaggctgaa 

gccaggccat 

tctgttttga 

tggctgggcc 

tgtttcaacc 

agggccacac 

aaaaatggta 

tcattttgtg 

cactttagga 

caacatggag 

agctacttgg 

agccgagatc 

aaaaaaaaaa 

atatatatat 

gctctgatca 

ttttgctcct 

gcctcccggg 

tgcaccacca 

gtcaggctgg 

ctgggattac 

cctttcgttt 



ttcctctgtc 
cctgggctca 
accatcatgc 
gggctagtct 
ggattacagg 
atagcatctg 
gctgttctga 
gtaacccatc 
gaactgagca 
ccaactcaca 
ggacacagcc 
cccactggag 
actcctgcca 
tggcaaaacg 
tcatgttaag 
tgtttaagaa 
ggctgagatg 
aaaccccgtc 
gagtctgaag 
gcgccattgc 
aaagaaaaga 
atatatatat 
taaaaaacaa 
gttgcccagg 
ttggagtgat 
tgcctggcta 
tctcgaactc 
agacgtgagc 
tacgttcaca 



acccagtggg 
agcgattctc 
caggctaatt 
tgaactcctg 
catgagccac 
aagagtcaac 
gcatttcgga 
tccgggagga 
tgaaccactg 
accaccttct 
gtcacattct 
ggcatcatgg 
agtggccctc 
gtccctggca 
atcttcattt 
cctttgggcc 
agcggatcac 
tctagtaaaa 
catgagaatt 
actccagcct 
aaagaaatta 
ttatttccct 
cttattttca 
ctggagtgca 
tctcctgcct 
attttgtact 
gcgacctcag 
caccatgccc 
tctttaattc 



63960 

64020 

64080 

64140 

64200 

64260 

64320 

64380 

64440 

64500 

64560 

64620 

64680 

64740 

64800 

64860 

64920 

64980 

65040 

65100 

65160 

65220 

65280 

65340 

65400 

65460 

65520 

65580 

65640 



f 
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ttctgggatg taattagatt tgatgagcaa ggtgggcatc cagcttgttt cttggctgat 65700 

ggcttatggg tggcgtgaat tagtcggggt ctatcaggag gcagaaactc tatgagaatt 65760 

tgaacagaga aagttccgtc tacaggctta ttaccaggga ctggaatagc agaaattgaa 6582 0 

cagtgagatg tacagagaac tctaagaatg caggaatagg ccaggcatgg tggctcacac 658 80 

ctgtcatccc agcactttgg gagaccaagg cgggtggatc acctgaggtc aggagttcga 6594 0 

gaccagcctg gccaacatag tgaaacccca tctctactaa aaatacaaaa aaattagctg 660 00 

ggtgtggtgg cgcatgcctg taatcccagc ttctcgggag tctgaggctg gagaatcact 66060 

tgaacctggg aggcagaggt tgtagtgagc cgagatcatg ccattgtact ccagcctggg 6612 0 

caacaagagc gagactcagt caaaacaaca acaacgcagg aatagcagat gagccgaggt 66180 

ggggcctccc cagcccccac cccccacccc gcaccctggg ccgagatcca gtcctctttg 66240 

aatagggcct gggcgtggtt cacgggacat ctgagacatt gccgaggcgc tgcactggtg 663 00 

gatcttgcca gaagtctgcc cagtgcagat ttgggcagaa tctcaaactg ccttgggatg 663 60 

taggagagaa accaggcctg gtcaagttca tgggaagagg tggaaacaga ccccataggc 66420 

tggggcttgg gcagctgtag gaagccctct ctgctgcctc cctgcctgct ctctgctttg 664 80 

aagcatcttc cccagtgccc ccagtctcat gccctctcaa cgttggggtc aaatcctgag 66540 

gaatacccag actggctctc tgggccaaag aggaccctct ccagaaagag cagggcccag 66600 

tgcggcttcc taaagggcag gggaagggcc tggccactcc ccagaggcta ctcaccagcc 66660 

atcaggatag ccccaggaag caggccttct cgagcccatt ttattacttt attttattat 6672 0 

tttatttaat tttaaattta ttttttgaga cagagtctca ctctgttgcc caggctggag 667 80 

tgcagtggtg cgatctcaac ccactgcagc ctctgcctcc agggttcaag ggattctccc 66840 

acctcagcct cccaagtagc tgggattaca ggtgcccgcc accacacccg gctaattttc 66900 

atatttttag tagagacgag gtttcaccat gttggccagg ctggtctcga actcctgacc 66960 

tcaagtgatc cgcccgcctc ggcctcccaa agtgctaggt caagcccatt ttaaagttga 67020 

agaaactgag gctgaggtaa attccctccc cagggatcct gctgcagcca gaaggtggta 67 0 80 

aaacaggact tcacccgggt ctgtctggcg tgaaaggcag tgttcttgta ccaccctagg 67140 

gggcctgaga gaactgagtc cctcgggcat aactgacagt tctgttccca ttattccgca 672 0 0 

ggggctcgga tctggctgta tgctttccag gatggccttg gagacccaca taagccctac 67260 

accctttggg aagctgcatg ttgggttggg gtgccgtcag tggcacttgt ggaaggtgca 67320 

gacctgtgtg ggtgtgtggg cccagggccc ctggtccctt cctccctttg tagggctggt 673 80 
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tgtgtgctgc ctggacctgg ggggcacgtt cacgtggtga atttgtctat ttactatccc 67440 

cgctttgggg ctggtgccag cacaggccct tgtgaagggg gtgcctttgt ctggagtggg 67500 

actgtggccc ctccctcagc gtggtgactt ctgtgtcagg gcttcagcag ggacgcagag 675 60 

cccctgagtg ttcggaacaa gggcgtcatt gcaggagtta gactgtgtgt gatggaggga 67 620 

ggaggggcag gaggaaaggt cagaaggaga gttcctggga aggtccctga ggagcctggt 67680 

gaggtgctaa ctggtgtgga ggacactcag ggcctgtggg gacatctcct actgctgggg 67740 

gccagccaca aagggaactg gccgaagtcc tgtccccgcc ttcacagccc agcatctggt 67800 

cacaaggcag gtacttggaa gggcgcgggc acctgggcca aaagtgcctg ggttcccttt 67860 

gcctttcact gagatgacct tcggggcagg tggctgctgc ctcccctcct gtccccaggt 67920 

tttgccaact ggccagagga aggggtcctg ggaagcaggg gggccagaag ccctctctgc 67980 

aaggaaagcc cgaggggtgt gggaggaagg aaggaatgcc caggctggcg aggctctaag 68040 

tcaccctggc ttggctctcc tcagatcctg aacccgccgc cctccccggc cacggacccc 68100 

tccctgtaca acatggacat gttctactct tcaaacattc cggccactgc gagaccgtac 68160 

aggtaggaca tcccctgcag ccctccatgg ccattgggtt cccgccagcc cgtggtggag 68220 

gggcctaatc cccatgccac tgatgagggg aggtattctg ggtgctagtg ggcaggtgcc 68280 

gggcccagcc ctgcctccct ctgctctgcc aaccacacta ggctgcctcc ccagacaagc 68340 

tcagcgggca ctgcatgttg ggttcagaaa tcagcagaac tccacgttct gagctgctct 684 00 

tcaagttgct cctatggggg ttacttttaa gctgggaaat ggctgtggcg tcgaggggcc 68460 

99999Cttgg gctccaaact ctgactgtgt gtttgagtcc ggctgtggaa acctagccat 68520 

tgagatgccc cctcttggtg gctctgtcct cttaggatgg gacaagtctg tgaaggctgc 68580 

tgcagcaccc accgtagacc cctaatcgtg tgacgtcacc aggatggtcc gggctgctca 68640 

cttgccacag tggcctgttt gagcccggga agccaacggg gctgctcagc tggacaccag 687 00 

ccccccgagc tgcccatgtt ggggtcacag gccccacctc cctggttggg gaggggcaac 68760 

tgagagtgtg gagaggtggg acccaggtgt gctggtctcc gcaggggctg gatcagagcc 68820 

t 999 at 999 c &999tgagcc tcctgacctt taacccagtg gtgtcaggca acgtggccca 68 8 80 

cccgccagcc gcaccaggcc ccacccccgc aggtgaaggg gtgggatagg ctgggcctgg 68940 

gccaggacac ctctggacca cgcattcctc attgcttggg tccctggagc agcagggcct 690 00 

cccgagtgtg gtgccgcctg ccacctagtg gccatttcca cgaactccca ggcctggctg 69060 

999 a 9ccgga actgcagcct ccatttccac cccactccgg gtcgggccac ctccctgatg 69120 



I. 
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cctcagtatt 
gaatggtcac 
tttgctcatg 
ctgacctcac 
acccggtgtt 
cctaaacaca 
acagttgggt 
gttgcttgaa 
cacgctgcta 
actcacagtt 
agtcacatct 
cccttgtaaa 
gaaccacccc 
tgggagtaca 
acatttacag 
tttctagcac 
catccccact 
aggtcagcta 
ggcttggcag 
ctcagagagg 
gcatcttctg 
gcccaccagg 
ctacatcatt 
cgactacagc 
agacccctat 
cccgccctcg 
ctgcacggac 
agttttaaat 
ggattttaaa 



atatcaaact 
atccacactg 
cccgtcctgc 
agcagctggg 
ggctgcacct, 
ccacagttcc 
gcaggtagcc 
ctcaggatgg 
ataaagacat 
ccacctgcct 
tacatggctt 
accatcaagt 
catgattcaa 
attcaagatg 
taacagtgtt 
cttccaggca 
tttatcaatg 
gcaggctcta 
gctgcagggc 
gtaagtgatg 
gagctggcca 
gcggatgtgc 
cgaggaatgg 
gccagccgct 
ccacccccac 
cccgccaccg 
tcatcctgac 
atgaacaaag 
aacatgagaa 



gtcacagtct 
ggcagcccag 
tctgggccca 
ccccaagagg 
tcccacccat 
gagtctgaac 
tctgggagga 
aagtgttccg 
acccaagact 

ggggtggcct 
cagggaacag 
ctagtgagat 
tcatctccca 
agatttgggt 
aggtgaacag 
gaatctcatg 
agaaactaaa 
gaggcctcgt 
aggtgcgggc 
agccctggcg 
gtggacaggc 
ctaccgaatc 
cgcccccgac 
ggaaggccag 
ccacgcccca 
agaggagcta 
ctcggccggg 
aaaaaaatat 
atgtgaactg 



gtcccacagc 
tctcgctagt 
ccgcggacac 
agtatcctgt 
tgcaggcccc 
tcacacagtg 
tgggaggtca 
ggcccattgg 
gggtaattgt 
cacaatcatg 
• acagcatgag 
ttattcacta 
ctgggtccct 
ggggacacag 
ttgtccagtc 
tatccttcac 
gctcgaagag 
tctccttaga 
atgcccatgg 
acacagcggg 
ctttcccgtt 
ccactcctct 
gacgccctgc 
caagtactac 
cagccagtac 
cttccatctc 
ccactctggc 
attttatgat 
tgatggggtg 



cttacagacc 
tcctcgtccc 
atcttccccc 
cctgctgcac 
tctgtgacag 
ggatgcggcg 
ggagccatct 

ttgctgtatt 
aaaggaaaga 
gtagaagaca 
aaccaagcga 
ccacgagaac 
cccacagcac 
ccaaacccta 
tcctgttttg 
tttcgaaatg 
gtcaagtaag 
ggcagccttg 
tagaggtggg 
gtgggtccag 
cacagccccg 
gtgtgtgtcc 
agcaccgacg 
ctggatttga 
ctgtcggcgg 
ttcccgcccc 
ttctctgtgc 
ttaaaaaata 
ggcagggctg 



actgtctcca 
acctcctgcc 
cgcccgccgt 
ttttctcaac 
gacgggggct 
tttctgggcc 
tgcgagtcag 
agcctgttct 
ggtttaacgg 

agg a gg a 9 ca 

aaggggtttc 
agtatggggg 
gtgggaatta 
tcggttgcca 
tcggacactg 
ggtactattt 
ttcctggcca 
ccagggccca 
accattgagg 
agtccggcct 
gggctgctgt 
ctttcaggcc 
tgtgtgacag 
actcggactc 
aggacagctg 
ctccgtcccc 
ccctgtaaat 
aatataattg 
ggagaacttt 



69180 

69240 

69300 

69360 

69420 

69480 

69540 

65600 

69660 

69720 

69780 

69840 

69900 

69960 

70020 

70080 

70140 

70200 

70260 

70320 

70380 

70440 

70500 

70560 

70620 

70680 

70740 

70800 

70860 
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gtacagtgga gaaatattta taaacttaat tttgtaaaac agaactgcca ttcttttgtg 70920 
ccctgtgtgc atttgagttg tgtgtccccg tggagggaat gccgaccccc ggaccaccat 70980 
gagagtcctc ctgcacccgg gcgtccctct gtccggctcc tgcagggaag ggctggggcc 
ttgggcagag gtggatatct cccctgggat gcatccctga gctgcaggcc gggccggctt 
tatgtgcgtg tggcctgtgc cgtcagaaag ggccctgggc ttcatcacgc tgttgctgtt 
cgtcttcctc agattcttag tctttttttt tttttttttt ttttgagacg .gagtctttct 
ctgtcatcca ggctggagtg cagtggtaca atctcagctc actgcaagct ccgactccca 
ggttcaagtg agtctcctgc ctcagcctcc cgagtagctg ggactacagg tgcgcgccac 
cacacccgcc cagctaattt ttgtattttt agtagagatg gggtttcacc atgttggcca 
ggatgatctc gatctcttga cctcgtgatc cgcccacctc ggcctcccaa agtgctggga 
ttataggcat gagccactgt acccagctga ctcttagtca cttttaagaa ggggactgtg 
ccttcatttf tcactgggcc ctgcagaata tatgcctggg ctctgggctc ttctgaacct 715 80 
gtgttggctt ccatctgacc tctctgtgcc agcccaaggc tgctgctctt cctgagggca 
aggagcccca tgactgcgtg ttgactcget ggatggggct gctgagccca " ctctgccaca 
ccacgtgccc ctggcaggga gggaatccct gggtcctcac aggaacagtc agcaagccac 
acctgacgcc tgctgtgggc ccatccctgc ggtgctggag aagacagaca aggcctggtc 7182 0 
actgcctctg cagggtcccc agtccgtgga aggagacagt aatctaggca ttttcggtgg 71880 
ggaagctgag ctgttctcgt gtcctgaagg ccaggcggga acagccgtct tcagagggaa 
gggagaaaat gcacatcgca tcagtggaga agggcctgac ttccctcagc atggtggagg 
gaggtcagaa aacagtcaag cttgagtatt ctatagtgtc acctaaata 



<210> 10 

<211> 8705 

<212> DNA 

<213> Homo sapiens 



<400> 10 

ggactcaggg gcagcaggga ggtacaccca tggttagtgg gcggaccata gggggtaatg 
agagggtgaa tcgatggaac ctgggggaca caatcgaagt ggttccagag tcgggctgta 



71040 
71100 
71160 
71220 
71280. 
71340 
71400 
71460 
71520 



71640 
71700 
71760 



71940 
72000 
72049 



60 
120 
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ctaattaaag agacggggca gtggacaggc attttcagtt gactgcccag ggagtgttct 180 



240 



780 
840 
900 
960 



gcccaacagg gaggatatgc gtacagaatc atactcgatc agcatgagtc caattcagac 

cgtacatcag tggagatatg ggtcccccga tgactccgtg gaacactgat gtttgtgaca 300 

ggggagtaca gcaccagcca tcagcaggcc agtaaatcat accggcctgc gaaattggac 3 60 

tcagacccgg atccaccctg accgacgtcc caagccccca ccccccaccc cccaccatgg 42 0 

gccgagatcc agtcctcttt gaatagggcc tggccgtggt tcacgggaca tctgagacat 480 

tgccgaggcg ctgcattggt ggatcttgcc agaagtttgc ccagtgcaga tttgggcaga 540 

atctcaaact gccttgggat gtaggagaga aaccaggcct ggtcaagttc atgggaagag 600 

gtggaaacag accccatagg ctggggcttg ggcagctgta ggaagccctc tctgctgcct 660 

ccctgcctgc tctctgcttt gaagcatctt ccccagtgcc cccagfctca tgccctctca 720 
acgttggggt caaatcctga ggaataccca gactggctct ctgggccaaa gaggaccctc 
tccagaaaga gcagggccca gtgcggcttc ctaaagggca ggggaagggc ctggccactc 
cccagaggct actcaccagc catcaggata gccccaggaa gcaggccttc tcgagcccat 
tttattactt tattttatta ttttatttaa ttttaaattt attttttgag acagagtctc 

actctgttgc ccaggctgga gtgcagtggt gcgatctcaa cccactgcag cctctgcctc 1020 

cagggttcaa gggattctcc cacctcagcc tcccaagtag ctgggattac aggtgcccgc 1080 

caccacaccc ggctaatttt catattttta gtagagatga ggtttcacca tgttggccag 1140 

gctggtctcg aactcctgac ctcaagtgat ccgcccgcct cggcctccca aagtgctagg 1200 

tcaagcccat tttaaagttg aagaaactga ggctgaggta aattccctcc ccagggatcc 12 60 

tgctgcagcc agaaggtggt aaaacaggac ttcacccggg tctgtctggc gtgaaaggca 1320 

gtgttcttgt accaccctag ggggcctgag agaactgagt ccctcgggca taactgacag 13 80 

ttctgttccc attattccgc aggggctcgg atctggctgt atgctttcca ggatggcctt 144 0 

ggagacccac ataagcccta caccctttgg gaagctgcat gttgggttgg ggtgccgtca 1500 

gtggcacttg tggaaggtgc agacctgtgt gggtgtgtgg gcccagggcc cctggtccct 1560 

tcctcccttt gtagggctgg ttgtgtgctg cctggacctg gggggcacgt tcacgtggtg 1620 

aatttgtcta tttactatcc ccgctttggg gctggtgcca gcacaggccc ttgtgaaggg 1680 

ggtgcctttg tctggagtgg gactgtggcc cctccctcag cgtggtgact tctgtgtcag 1740 

ggcttcagca gggacgcaga gcccctgagt gttcggaaca agggcgtcat tgcaggagtt 1800 

agactgtgtg tgatggaggg aggaggggca ggaggaaagg tcagaaggag agttcctggg I860 
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aaggtccctg aggagcctgg tgaggtgcta 


actggtgtgg 


aggacactca 


gggcctgtgg 


1920 


ggacatctcc tactgctggg ggccagccac 


aaagggaact ggccgaagtc 


ctgtccccgc 


1980 


cttcacagcc cagcatctgg tcacaaggca ggtacttgga agggcgcggg 


cacctgggcc 


2040 


aaaagtgcct gggttccctt tgcctttcac 


tgagatgacc 


ttcggggcag gtggctgctg 


2100 


cctcccctcc tgtccccagg ttttgccaac 


tggccagagg 


aaggggtcct 


gggaagcagg 


2160 


ggggccagaa gccctctctg caaggaaagc 


ccgaggggtg 


tgggaggaag 


gaaggaatgc 


2220 


nr* artrrni- nnn rraacrPt" ft" 3.3 CffccaCCCtCfQ 


cttggctctc 


ctcagatcct 


gaacccgccg 


2280 


ccctccccgg ccacggaccc ctccctgtac 


aacatggaca 


tgttctactc 


ttcaaacatt 


2340 


ccggccactg cgagaccgta caggtaggac 


atcccctgca gccctccatg gccattgggt 


2400 


tcccgccagc ccgtggtgga ggggcctaat 


ccccatgcca 


ctgatgaggg gaggtattct 


24 6 0 


gggtgctaat gggcaggtgc cgggcccagc 


cctgcctccc 


tctgctctgc 


caaccacact 


252 0 


aggctgcctc cccagacaag ctcagcgggc 


actgcatgtt 


gggttcagaa 


atcagcagaa 


2580 


ctccacgttc tgagctgctc ttcaagttge 


tcctatgggg 


gttactttta 


agctgggaaa 


2640 


tggctgtggc gtcgaggggc cgggggcttg ggctccagag tctgactgtg tgtttgagtc 


2700 


cggctgtgga aacctagcca ttgagatgcc 


ccctcttggt 


ggctctgtcc 


tcttaggatg 


2760 


ggacaagtct gtgaaggctg ctgcagcacc 


caccgtagac 


ccctaatcgt 


gtgacgtcac . 


2820 


caggatggtc cgggctgctc acttgccaca gtggcctgtt 


tgagcccggg 


aagccaacgg 


2880 


ggctgctcag ctggacacca gccccccgag 


ctgcccatgt 


tggggtcaca 


ggccccacct 


2940 


ccctggttgg ggaggggcaa ctgagagtgt 


ggagaggtgg 


gacccaggtg 


tgctggtctc 


3000 


cgcaggggct ggatcagagc ctgggatggg 


cagggtgagc 


ctcctgacct 


ttaacccagt 


3060 


ggtgtcaggc aacgtggccc acccgccagc 


cgcaccaggc 


cccacccccg 


caggtgaagg 


3120 


ggtgggatag gctgggcctg ggccaggaca 


cctctggacc 


acgcattcct 


cattgcttgg 


3180 


gtccctggag cagcagggcc tcccgagtgt 


ggtgccgcct 


gccacctagt 


ggccatttcc 


3240 


acgaactccc aggcctggct ggggagccgg 


aactgcagcc 


tccatttcca 


ccccactccg 


3300 


ggtcgggcca cctccctgat gcctcagtat 


tatatcaaac 


tgtcacagtc 


tgtcccacag 


3360 


ccttacagac cactgtctcc agaatggtca 


catccacact 


gggcagccca gtctcgctag 


3420 


ttcctcgtcc cacctcctgc ctttgctcat 


gcccgtcctg 


ctctgggccc 


accgcggaca 


3480 


catcttcccc ccgcccgccg tctgacctca 


cagcagctgg 


gccccaagag 


gagtatcctg 


. 3540 


tcctgctgca cttttctcaa cacccggtgt 


tggctgcacc 


ttcccaccca 


ttgcaggccc 


3600 
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4200 



ctctgtgaca ggacgggggc tcctaaacac accacagttc cgagtctgaa ctcacacagt 3 660 

gggatgcggc gtttctgggc cacagttggg tgcaggtagc ctctgggagg atgggaggtc 3720 

aggagccatc ttgcgagtca ggttgcttga . actcaggatg gaagtgttcc gggcccattg 3780 

gttgctgtat tagcctgttc tcacgctgct aataaagaca tacccaagac tgggtaattg 3 840 

taaaggaaag aggtttaacg gactcacagt tccacctgcc tggggtggcc tcacaatcat 3 900 

ggtagaagac aaggaggagc aagtcacatc ttacatggct ■ tcagggaaca gacagcatga 3960 

gaaccaagcg aaaggggttt ccccttgtaa aaccatcaag tctagtgaga tttattcact 4020 

accacgagaa cagtatgggg ggaaccaccc ccatgattca atcatctccc actgggtccc 4080 
tcccacagca cgtgggaatt atgggagtac aattcaagat gagatttggg tggggacaca 
gccaaaccct atcggttgcc aacatttaca gtaacagtgt taggtgaaca gttgtccagt 

ctcctgtttt gtcggacact gtttctagca ccttccaggc agaatctcat gtatccttca 42 60 

ctttcgaaat gggtactatt tcatccccac ttttatcaat gagaaactaa agctcgaaga 4320 

ggtcaagtaa gttcctggcc "aaggtcagct agcaggctct agaggcctcg ttctccttag 4380 
aggcagcctt gccagggccc aggcttggca ggctgcaggg caggtgcggg catgcccatg 
gtagaggtgg gaccattgag gctcagagag ggtaagtgat gagccctggc; gacacagcgg 

ggtgggtcca gagtccggcc tgcatcttct ggagctggcc agtggacagg cctttcccgt 4560 

tcacagcccc ggggctgctg tgcccaccag ggcggatgtg cctaccgaat cccactcctc 4620 

tgtgtgtgtc cctttcaggc cctacatcat tcgaggaatg gcgcccccga cgacgccctg 4680 

cagcaccgac gtgtgtgaca gcgactacag cgccagccgc tggaaggcca gcaagtacta 4740 

cctggatttg aactcggact cagaccccta tccaccccca cccacgcccc acagccagta 4800 

cctgtcggcg gaggacagct gc.ccgccctc gcccgccacc gagaggagct - acttccatct 4860 
cttcccgccc cctccgtccc cctgcacgga ctcatcctga cctcggccgg gccactctgg 
cttctctgtg cccctgtaaa tagttttaaa tatgaacaaa gaaaaaaata tattttatga 
tttaaaaaat aaatataatt gggattttaa aaacatgaga aatgtgaact gtgatggggt 5040 
gggcagggct gggagaactt tgtacagtgg agaaatattt ataaacttaa ttttgtaaaa 5100 
cagaactgcc attctttcgt gccctgtgtg catttgagtt gtgtgtcccc gtggagggaa 5160 
tgccgacccc cggaccacca tgagagtcct cctgcacccg ggcgtccctc tgtccggctc 5220 
ctgcagggaa gggctggggc; cttgggcaga ggtggatatc tcccctggga tgcatccctg 5280 
agctgcaggc cgggccggct ttatgtgcgt gtggcctgtg ccgtcagaaa gggccctggg 5340 



4440 
4500 



4920 
4980 
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cttcatcacg ctgttgctgt tcgtcttcct cagattctta gtcttttttt tttttttttt 5400 

ttttttgaga cggagtcttt ctctgtcatc caggctggag tgcagtggta caatctcagc 5460 

tcactgcaag ctccgactcc caggttcaag tgagtctcct gcctcagcct cccgagtagc 5520 

tgggactaca ggtgcgcgcc accacacccg cccagctaat ttttgtattt ttagtagaga 5580 

tggggtttca ccatgttggc caggatgatc tcgatctctt gacctcgtga tccgcccacc 5640 

tcggcctccc aaagtgctgg gattataggc atgagccact gtacccagct gactcttagt 5700 

cacttttaag aaggggactg tgccttcatt tttcactggg ccctgcagaa tatatgcctg 5760 

ggctctgggc tcttctgaac- ctgtgttggc ttccatctga cctctctgtg ccagcccaag 5820 

gctgctgctc ttcctgaggg caaggagccc catgactgcg tgttgactcg ctggatgggg 5880 

ctgctgagcc cactctgcca caccacgtgc ccctggcagg gagggaatcc ctgggtcctc 5940 

acaggaacag tcagcaagcc acacctgacg cctgctgtgg gcccatccct gcggtgctgg 6000 

agaagacaga caaggcctgg tcactgcctc tgcagggtcc ccagtccgtg gaaggagaca 6060 

gtaatctagg cattttcggt ggggaagctg agctgttctc gtgtcctgaa ggccaggcgg 6120 

gaacagccgt cttcagaggg aagggagaaa atgcacatcg catcagtgga gaagggcctg 6180 

acttccctca gcatggtgga gggaggtcag aaaacagtca agcttgttgc tgggtgacag 6240 

tgcatttaat aatcaaaata taggctgggt acggtggctc atgcctgtaa tcccagcact 63 0 0 

ttgggaggct gaggcaggtg gatcacttga ggccaggagt ttgagaccgg cctggccaac 6360 

atggcaaaac ctcaactact aaaatacaaa aactagccgg gcgtggtggt gcacgcctgt 6420 

aatcccagct acttgggagg ctgaggcagg agaattgctt gaacctggga ggcggaggct 6480 

gcagtgagcc gagattgtgc cactgcactc cagcctgggc aacagagcaa gactctgtct 654 0 

caaaaaaaaa aaaaaaaaaa gcaatacaaa atacaaatat cactttcact aaaagaaggg 6600 

atggaagacc caaaacaaac agaaaacaac aaaatggcag gagtaagtcc ccacttatca 6660 

ataataacat tgactgtaaa taggctaagc tctgcaatca aaagagtggg ccaggagcgg 6720 

tggctcacgc ctgtaattcc aacgctttgg gaggctgagg cggatggatc atttgatgtc 6780 

acgagtttta agaccagcct ggccaacaag gtgaaacccc atctgtacta aaaatacaaa 6840 

aattagccag gcggtagtgg cacgcacctg taatcccagc tacttgtgag gctgaggcag 6900 

gagaatcact ggaggctggg aagcggaggt tgctgtgagc caagatggag ccactgcact 6960 
cccacctggg cgacagagtg agatcctgtc ttaagaaaaa aaagagtgga tgaatggatc 
aaaaaacaa g acccaaccat ctcttgcata caagaaacac actttaccta taaaaacaca 



7020 
7080 
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ctaggccagg tgtggtggct cacacctgta atcccagccc tttgggaggc ctgactggca 7140 
gatcacctga ggccaggagt ttcagaccag cttgaccgac atggcaaaac cccatctctc 7200 
ctaaaaatac aaaaaaacaa aaaaaagaaa aaggctggaa gtagtgatgt gtgcctgtag 7260 
ccccagctac ttgggaggct gaggcaggag aattgcttga atccgggaag tggaggttgc 7320 
agtgagccag gatggtgcca ctgcactcca gcctgggtga cagagcgaga ccctgtcata 7380 
aaaaaaaaaa gaaaagaaaa gaaaaacgag aaaaacaaac acaaaattag tagaagaaaa 744 0 
gaaataataa agatcagaac aggccaggct catgggcaca gtggctcaac tcctacctgc 7500 
tcaggagttt gagaccagtc tggccaacat ggcaaaaccc catctctcct aaaaatatga 7560 
aaaaaaaaaa ataggctgga tgtggtgatg tgtgtgtgcc tgtagcccca gctacttggg 7620 
aggctgaggt gggagaatca cttgagccca ggaagtggag gctgcagcga gtcatgaatg 7680 
caccctgcac tctagctggg taactggagt gagattctgt -ctcaaaaaag caaagaccag 7740 
agcagaaata aatgaaatgg aaatgaagga aacaatgcaa aatgatacaa aaagtttttt 7800 
cgaaaagata aacaaaatca acaaaccttt agccagatta agaaaaaaag agagaagacc 7860 
caaataaata aaatccgaga ttaaaaagga gacattacca ctgataccac agaaattcaa 7920 
aggatcatta gaggcaacta tgtgcaacta tatgctaatg aactggaaaa cctagaagaa 
ctgggtaaat ttctagacac atacaaccta tcaagattga accatgaaga aatccaaaac 
ctgaacaggc cgggcacggt ggcttacgcc tgtaatccca gcactttgga aggcctgaga 
tcaggagttc gagaccagcc tggccaacat ggtgaaaccc catctctact gaaaaaatat 
aaaaattagc cgggcgtggt ggcgggtgcc tctaatgtca gccactcggg aggctgaggc 
aggaaaatca cttgaacctg ggaggcatag gttgcagcga gccgaggttg caccactgca 828 0 
ctccagcctt ggcgacagag ccagactcca tctcaaaaaa attaaaataa caaaaacctg 
aacagaccaa taacaagtaa tgcgatgaaa actgtaataa aatgtttccc aacaaagaaa 
gcccaggaac aaatggcttc actgctgaat tttaccaaac attttttttt ttttgagacg 8460 
gagtctcgct ctgtcgccca ggctggagtg cagtggtgta acctcggttc gctggtaact 8520 
tatgcctctc aggctgcaag tgattttcct gcttcaggcc ccccgagtgg ctggaaatta 
gatggtactt gtcaaacaag gcctggctaa atttctatat ttccttcaag tagaagatgt 
gcttccaaca aaggttgggt tacggctggc ttctgaaaat cttggatttc aaggctcccc 8700 
aaaag 



7980 

8040 
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<210> 11 

<211> 66933 

<212> DNA 

<213> Homo sapiens 



<400> 11 

tataatcaag cgcgttccgt ccagtccggt gggaagattt tcgatatgct tcgtgatctg 60 

ctcaagaacg ttgatcttaa agggttcgag cctgatgtac gtattttgct taccaaatac 12 0 

agcaatagta atggctctca gtccccgtgg atggaggagc aaattcggga tgcctgggga 180 

agcatggttctaaaaaatgt tgtacgtgaa acggatgaag ttggtaaagg tcagatccgg 24 0 

atgagaactg tttttgaaca ggccattgat caacgctctt caactggtgc ctggagaaat 300 

gctctttcta. tttgggaaec tgtctgcaat gaaattttcg atcgtctgat taaaccacgc 3 60 

tgggagatta gataatgaag cgtgcgcctg ttattccaaa acatacgctc aatactcaac 420 

cggttgaaga tacttcgtta tcgacaccag ctgccccgat ggtggattcg ttaattgcgc 480 

gcgtaggagt aatggctcgc ggtaatgcca ttactttgcc tgtatgtggt cgggatgtga 54 0 

agtttactct tgaagtgctc cggggtgata gtgttgagaa gacctctcgg gtatggtcag 600 

gtaatgaacg tgaccaggag ctgcttactg aggacgcact ggatgatctc atcccttctt 660 

ttctactgac tggtcaacag acaccggcgt tcggtcgaag agtatctggt gtcatagaaa 720 

ttgccgatgg gagtcgccgt cgtaaagctg ctgcacttac cgaaagtgat tatcgtgttc 780 

tggttggcga gctggatgat gagcagatgg ctgcattatc cagattgggt aacgattatc 840 

gcccaacaag tgcttatgaa cgtggtcagc gttatgcaag ccgattgcag aatgaatttg 900 

ctggaaatat ttctgcgctg gctgatgcgg aaaatatttc acgtaagatt attacccgct 960 

gtatcaacac cgccaaattg cctaaatcag ttgttgctct tttttctcac cccggtgaac 1020 
tatctgcccg gtcaggtgat gcacttcaaa aagcctttac agataaagag gaattactta 
agcagcaggc atctaacctt catgagcaga aaaaagctgg ggtgatattt gaagctgaag 

aagttatcac tcttttaact tctgtgctta aaacgtcatc tgcatcaaga actagtttaa 1200 

gctcacgaca. tcagtttgct cctggagcga cagtattgta taagggcgat aaaatggtgc 1260 

ttaacctgga caggtctcgt gttccaactg agtgtataga gaaaattgag gccattctta 1320 

aggaacttga aaagccagca ccctgatgcg accacgtttt agtctacgtt tatctgtctt 1380 



1080 
1140 
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tacttaatgt cctttgttac aggccagaaa gcataactgg cctgaatatt ctctctgggc 1440 

ccactgttcc acttgtatcg tcggtctgat aatcagactg ggaccacggt cccactcgta 1500 

tcgtcggtct gattattagt ctgggaccac ggtcccactc gtatcgtcgg tctgattatt 1560 

agtctgggac cacggtccca ctcgtatcgt cggtctgata atcagactgg gaccacggtc 1620 

ccactcgtat cgtcggtctg attattagtc tgggaccatg gtcccactcg tatcgtcggt 1680 

ctgattatta gtctgggacc acggtcccac tcgtatcgtc ggtctgatta ttagtctgga 1740 

accacggtcc - cactcgtate gtcggtctga ttattagtct gggaccacgg tcccactcgt 1800- 

atcgtcggtc tgattattag tctgggacca cgatcccact cgtgttgtcg gtctgattat 1860 

.cggtctggga ccacggtccc acttgtattg tcgatcagac tatcagcgtg agactacgat 1920 

tccatcaatg cctgtcaagg gcaagtattg acatgtcgtc gtaacctgta gaacggagta 1980- 

acctcggtgt gcggttgtat gcctgctgtg gattgctgct gtgtcctgct tatccacaac 2040 

attttgcgca cggttatgtg gacaaaatac ctggttaccc aggccgtgcc ggcacgttaa 2100 

ccgggctgca tccgatgcaa gtgtgtcgct gtcgacgagc tcgcgagctc ggacatgagg 2160 

ttgccccgta ttcagtgtcg ctgatttgta ttgtctgaag ttgtttttac gttaagttga 2220 

tgcagatcaa ttaatacgat acctgcgtca taattgatta tttgacgtgg tttgatggcc 2280 

tecacgcacg ttgtgatatg tagatgataa tcattatcac tttacgggtc ctttccggtg 2340 

atccgacagg ttacggggcg gcgacctcgc gggttttcgc tatttatgaa aattttccgg 2400 

tttaaggcgt ttccgttctt cttcgtcata acttaatgtt tttatttaaa ataccctctg 2460 

aaaagaaagg aaacgacagg tgctgaaagc gagctttttg gcctctgtcg tttcctttct 2520 

ctgtttttgt ccgtggaatg aacaatggaa gtccgagctc atcgctaata acttcgtata 2580 

gcatacatta tacgaagtta tattcgatgc ggccgcaagg ggttcgcgtc agcgggtgtt 2 640 

ggcgggtgtc ggggctggct taactatgcg gcatcagagc agattgtact gagagtgcac 2700 

catatgcggt gtgaaatacc gcacagatgc gtaaggagaa aataccgcat caggcgccat 2760 

tcgccattca ggc.tgcgcaa ctgttgggaa gggcgatcgg tgcgggcctc ttcgctatta 2 82 0 

cgccagctgg cgaaaggggg atgtgctgca aggcgattaa gttgggtaac gccagggttt. 2880 

tcccagtcac gacgttgtaa aacgacggcc agtgaattgt aatacgactc actatagggc 2 94 0 

. gaattcgagc- tcggtacccg gggatcctct agagtcgacc tgcaggcatg caagctt'ctc 3000 

ttgtgccggt tgtacgctgt caggtcacac tggtgagtta ggcagggcac agatgcccag 3060 

agcagaggga. actttccttg gggattcaac acgtgcaagt cttaggggct ggcaaatcct 312 0 
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agaga.yyyyy 


cttttatttg agaccagaat 


cacctgagca tcctcctgtc 


3180 


cccagctgtg 


tccagcctgt 


ctgcagggac 


atcctgagag 


gaccaggctc 


tcccctcatc 


3240 


cacctgccta 


agtgc.cactc 


tgaaccctgt 


ccacctgtgc 


cgtggagggg 


cgtgacctca 


3300 


agctgctcag 


ccagcagcag 


gcttggccct ggggggcagc agagacccag gtggctgtgg 


3360 


ggtgggtgct tcgtggcgtg gttctgaaac ttcgttggaa gtgtgtggac agtgccttgc 


3420 


ctgttctctg 


tgggacccta 


tttagaaacg 


aggtctgagt 


tactgggggt 


catcactgtg 


3480 


ttctgatggc 


ccagctgtgt 


ggaggccgcg 


gtgcagcccc 


atccaaggag 


ccagggccct 


3540 


gggtctagcc 


gtgaccagaa 


tgcatgcccc 


ggaggtgttt 


ctcatctcgc 


acctgtgttg 


3600 


cctggtgtgt 


caagtggtcg. 


tgaaactctg 


tgttagctct 


tggtgttcct 


gaaagtgccc 


3660 


ccgggtctca ggcctcagaa 


ccagggtttc 


ccttcatctc ggtggcctgg gagcatctgg 


3720 


gcagttgagc 


aaagagggcg 


attcacttga 


aggatgtgtc 


tggccctgcc 


taggagcccc 


3780 


ccggcacggt 


gctggggcct 


gaagctgccc 


tcgggtggtg 


gagaggaggg 


agcgatgaag 


3840 


tggcgtcgag 


ctgggcagga 


agggtgagcc 


cctgcaaggt 


gggcatgctg 


gggacgctga 


3900 


gcagcatggc 


cagcagctgg 


gtctgcagcc 


tggtacccgg 


cgggacttgt 


ggttggggct 


3960 


ggtttgtggc 


caggagaggg 


gctggcagga 


gacaaggggg 


actgtgaggc 


agctcccacc 


4020 


cagcagctga 


agcccaatgg 


cctggctgtg 


tggctctcag 


ctgcgtgcat 


aacctctcag 


4080 


tgcttcagtt 


ctctcatttg 


taaaatgagg 


aaacaaacag 


tgccagcctc 


ccagaggtgt 


4140 


catgaggatg aacgagtgac 


catgtagcat 


gggctgggtg cgtgtcacct 


aacatcacca 


4200 


gcctttgcaa 


ggagagccct 


gggggcctgg ctgagtattt 


cccttgcccg 


gcccacccca 


4260 


ggcctagact 


tgtgcctgct 


gcaggccctt 


gacccctgac 


cccattgcac 


ctgtctccac 


4320 


aggagccgag 


gaggtgctgc 


tgctggcccg 


gcggacggac 


ctacggagga 


tctcgctgga 


4380 


cacgccggac 


ttcaccgaca tcgtgctgca ggtggacgac 


atccggcacg 


ccattgccat 


4440 


cgactacgac ccgctagagg gctatgtcta ctggacagat gacgaggtgc gggccatccg 


4500 


cagggcgtac 


ctggacgggt 


ctggggcgca 


gacgctggtc 


aacaccgaga 


tcaacgaccc 


4560 


cgatggcatc 


gcggtcgact 


gggtggcccg 


aaacctctac 


tggaccgaca 


c ggg cac gg a 


4620 


ccgcatcgag 


gtgacgcgcc 


tcaacggcac 


ctcccgcaag 


atcctggtgt 


cggaggacct 


4680 


ggacgagccc 


cgagccatcg 


cactgcaccc 


cgtgatgggg 


taagacgggc 


gggggctggg 


4740 


gcctggagcc 


agggccaggc 


caagcacagg 


cgagagggag 


attgacctgg 


acctgtcatt 


4800 


ctgggacact 


gtcttgcatc 


agaacccgga 


ggagggcttg 


ttaaaacacc 


ggcagctggg 


4860 
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ccccaccccc agagcggtga ttcaggagct ccagggcggg gctgaagact tgggtttcta 492 0 

acaagcaccc cagtggtccg -gtgctgctgc tgggtccatg cgtagaaagc cctggagacc 4980 

tggagggagc cctttgttcc cctggcttca gtttcctcat ctgtagaatg gaacggtcca 5040 

tctgggtgat ttccaggatg acagtagtga cagtaagggc agcctctgtg acactgacca 5100 

cagtacaggc caggcctctt tttttctttt tttttttttg agatggagtc tcactctgtc 5160 

gcccaggctg gagtgcagtg gtgtgatctc agctcactac aacctctgcc tcctgggctc 5220 

aagtgattct cctgcctcag cctcctgagt agctgggatt acaggtgcct gccactgtgc 5280 

ttggctaatg tttgtatttt tggtagagat ggggtttcac cgtcttggcc aggctggtcg 5340 

caaactcctg acctcaggtg atccacctgc ctcagcctcc caaagtgctg ggattacagg 5400 

catgagccac cacgcccggt caggccaggc ctcttttgaa cactttgcac accatgggtc 5460 

ttttcatcca ggggggtagg tacagttgta cagttgagga cactgaagcc cagagaggct 5520 

cagggacttg cccagggtca" cacagcagga tgtggcaggt gtggggctgg gcctggcagc 5580 

gtggctccag ctttccagca tagaaatctg tgaaagcaga tagtttgtcg gtcggtaggg 564 0 

gagactttct gagacccgcc ccagcggctc agagggtagt agccaggggc cttcctgggg 5700 

gctcataacc cagaacactg aatgggaaaa ccctgatgga ggaggcgcag tggagctgtg 5760 

ggtgccgatg ggaagtccca gaggagctgg gaggtcagta gcggtgctgc cctctgtgga 5820 

gcacttagtg ggcaccaggt gtgtttccag gttcatggcc ctgggacctg aagctcagaa 5880 

ggtgaagtaa cttgcccagg gcacccgtcg ggcagcggcg ggcagaggat ttgtgggctg 5940 

tggagcctgt gctcgtggcc cagccctggg ggttgtgagt gtgctggccg gggagctttt 6000 

cctgcaagtg gactggtgtc taggagccag catgtcaggc agcaggcagc gggagtgcag 6060 

caggcagcgg gagcacagca ggcagagggc ggggctcgag cagccatccg tggaccctgg 6120 

ggcacggagg catgtgggag agggctgctc catggcagtg gctgaagggc tgggttgtgc 6180 

cccgaggagg gtggatgagg gtaagaagtg gggtccccag gggctttagc aagaggaggc 624 0 

ccaggaactg gttgccagct acagtgaagg gaacacggcc ctgaggtcag gagcttggtc 6300 

aagtcactgt ctacatgggc ctcggtgtcc tcatctgtga aaaaggaagg gatggggaag 6360 

ctgactccaa ggcccctcct agccctggtt tcatgagtct gaggatccca gggacatggg 6420 

cttggcagtc tgacctgtga ggtcgtgggg tccagggagg ggcaccgagc tggaagcggg 64 8 0 

aggcagaggg gctggccggc tgggtcagac acagctgaag cagaggctgt gacttggggc 654 0 

ctcagaacct tcacccctga gctgccaccc caggatctgg gttccctcct tggggggccc 6600 
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cagggaacaa 


gtcacctgtc ctttgcatag gggagccctt cagctatgtg cagaaggttc 


6660 


tgctctgccc 


cttcctccct ctaggtgctc agctcctcca gcccactagt cagatgtgag 


6720 


gctgccccag accctgggca gggtcatttc tgtccactga 


cctttgggat 


gggagatgag 


6780 




ctgagagtcc aagggetggt gtggtgaaac ccgcacaggg tggaagtggg 


6840 


pat- rrntatc 


ccaggggagc ccccagggac tctggtcact gggcttgccg ctggcatgct 


6900 


^ ciy l- v-- iwV^v-»ci , 


gcacttactg acaccagcat ctactgacac 


caacatttac 


aaacaccgac 


6960 


attgaccgac 


accgacattt accgacactg acatttacca 


acactgttta 


ccaacactga 


7020 


catctactga 


cactggcatc taccaacact gacatttacc 


gacactgaca 


tttaccaaca 


7080 


ctatttacca 


acactgacat ctactgacat tggcatctac 


caacaccaac 


atttaccgac 


7140 


accaacattt 


accaacactg aaatttaccg acaccgacat 


ttaccgacac 


cgtttaccaa 


7200 


caccgacgtt 


"taccgacacc gacatttacc gacactgata 


tttaccaaca 


ctgacatcta 


7260 


ctgacgctgg 


catctactga caccgatgcc agcatctacc 


aacaccgaca 


tttaccaaca 


7320 


ctgacattta ctgacactga tatctactga cactggcatc 


tactgacacc 


aacatttacc 


7380 


aacaccagca 


tctaccaaca ccgacattta ccaacaccag 


catttaccaa caccgatgtt 


7440 


taccaacgcc 


gacgtttacc gacgccagca tctaccaaca 


ctgacattta 


ccgacaccga 


7500 


catttaccga cactgacatt tactgacact gacatctact gatactggca 


tctaccgaca 


7560 


ctgatattta 


ccaacgccag catctactga cactgatgtt 


taccaacacc 


gacatttacg 


7620 


agcaccgaca 


tttactgaca ccaatattta ctgacatcaa 


catttagcca 


tgtgatgggg 


7680 


gccggcttgg 


gggcaggcct tgctcttggc actggggatg 


ctgcagagac 


cagacagact 


7740 


catggggtca 


tggacttctg cttcttctcc agcctcatgt 


actggacaga 


ctggggagag 


7800 


aaccctaaaa 


tcgagtgtgc caacttggat gggcaggagc 


ggcgtgtgct 


ggtcaatgcc 


7860 


tccctcgggt 


ggcccaacgg cctggccctg gacctgcagg 


aggggaagct 


ctactgggga 


7920 


gacgccaaga 


. cagacaagat cgaggtgagg ctcctgtgga 


catgtttgat 


ccaggaggcc 


7980 


aggcccagcc 


accccctgca gccagatgta cgtattggcg aggcaccgat gggtgcctgt 


8040 


gctctgctat 


ttggccacat ggaatgcttg agaaaatagt 


tacaatactt 


tctgacaaaa 


8100 


acgccttgag 


agggtagcgc tatacaacgt cctgtggtta 


cgtaagatgt 


tatcattcgg 


8160 


ccaggtgcct 


gtagacacag ctacttggag actgaggtgg gaggatcgct ggagtccaag 


8220 


agtttgaggc 


cagcccgggc aaaggggaca caggaatcct 


ctgcactgct 


tttgccactt 


8280 


actgtgagat 


ttaaattatt tcacaataca aaattaagac 


aaaaagttaa 


tcacatatcc 


8340 
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actgccctgc ttaagacaga, aaacatgggt gttgttgaag ccagaggcag ctgctggcct 8400. 

gagtttggtg attggttcct aagcagttga aggcagtttt gtttttccat agatgtctgt 8460 

tctccctttg ctgggtgcag cctcgccctg ctgctgtggt cgggtttcag tggcctcgtc 8520 

ccgtggacgc agcctcgccc tgccgctgtg gtcgggtttc agtggcctcg tcccgtggac 8580 

gcagcctcgc cctgctgctg tggtcgggtt tcagtggcct cgtcccgtgg acgcagcctc 8640 

gccctgccgc tgtggtcggg tttcagtggc ctcgtcccgt ggacgcagcc tcgccctgcc 8700 

gctgtggtcg ggtttcagtg gcctcgtccc atgggcgtgc tttggcagct ttttgctcac 8760 

ctgtggagcc tctcttgagc ttttttgttt gttgtttgtt tttgtttgat tttgtttgat 8820 

tgtttgtttt tgttgtcgtt gttgttgccc aggctggagt gcagtggcgc gatctcagct 8880 

cactgaaacc tctgcctcct tgggttcatg ccattctcct gcctcagcct cccacatagc 8940 

tgggattaca agtgcccgcc accacgcctg gctaaatttt gtatttttag tagacagggg 9000 

gtttcaccat gttggtcagg ctggtctgga actcctggtc tcacatgatc cacctgcctc 9060 

ggcctcccaa agtgttggga ttacaggcgt gagccaccgc gcccagcctc tgttgagcat 9120 

attttgaggt tctcttggtg ccagtgatat gtacatgtgt ccccatcgca ccatcgtcac 9180 

ccattgaggt gacattggtg cctctcctcg gggtggatgc ctccctctgt ttccagcaac 9240 

ttctgaagga ttttcctgag ctgcatcagt ccttgttgac gtcaccatcg gggtcacctt 9300 

tgctctcctc agggctccca ggggaggccc gaatcaggca gcttgcaggg cagggcagga 9360 

tggagaacac gagtgtgtgt ctgtgttgca ggatttcaga ccctgcttct gagcgggagg 942 0 

agtctcagca ccttcagggt ggggaaccca gggatggggg aggctgagtg gacgcccttc 9480 

ccacgaaaac cctaggagct gcaggtgtgg ccatttcctg ctggagctcc ttgtaaatgt 954 0 

tttgtttttg gcaaggccca tgtttgcggg ccgctgagga tgatttgcct tcacgcatcc 9600 

ccgctacccg tgggagcagg tcagggactc gcgtgtctgt ggcacaccag gcctgtgaca 9660 

ggcgttgttc catgtactgt ctcagcagtg gttttcttga gacagggtct cgctcgctca 9720 

, cccaggcgag agtgcagtgg cgcaatcacg gctcgctgta gcctcaatct ccctgggctc 978 0 

aggtgatcct cctgcctcac cctctgagta gctgggacta cagacacata ccaccacacc 9840 

eagctagttt ttgtgtattt tttgtggggg gagatggggt ttcgctgtgg tgcccaagct 9900 

gatctcaaac tcctgaggca caagcgatcc acctgcctcg gcctcccaaa gtgctgggat 9960 

gacaggcatc agccgtcaca cgcagctcaa tgattttatt gtggtaaaat aaacatagca 10020 

caaaattgat gattttaacc attttaaagt gaacagttca ggctgggcgt ggtggcttat 10080 
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ccagtacttt gagaggctga ggtgggcaga 


tcacctgagg 


tcaggagttt 


10140 




tggccaacat gatgaaatcc agtctctact 


aaaaatacaa 


aaattagccg 


10200 




caggtgcctg taatcccagc tactcgggag 


gctgaggcag 


gagaatcgct 


10260 




aggtggaggt tgcagtgatc tgagatcatg ccactgcact ccaatctgtg 


10320 




agactctgtc ttgaaaaata aataaataaa 


aaaaatttta aaaagtgaac 


10380 




atttagtatg aggacaatgt ggtgcaggta 


tctctgctac 


tatctacttc 


10440 


tagaacactt 


tcttctgccc tgaaggaaac cccatgccca 


ccggcactca 


cgcccattct 


10500 


cccctctctc 


ccagcctctg tcaaccacta atctactttc 


tgtctctggg ggttcacttc 


10560 


ttctggacgt 


tttgtgtgac tggaatcctg caatatgtgg tccctgcgtg tggcttcttt 


10620 


ccatagcatt gtgttttcca gattcaccca cacattgtcg cacgttatca gaatctcatt 


10680 


cctgactggg tgcagtgggt taggcctgta atcctaacat 


tctgggaggc 


caaggcggga 


10740 


cgatcacttg aggcaggagt ttgagaccag cctggccagc 


ctagcaagac 


cccagctacc 


10800 


aaaaaatttt 


aaaagttaac tgaacgtggt ggtggtgggc 


acttgtggtt 


cccagctacc 


10860 


tgggaggctg 


aggtgggagg atcgcttaag cccaggaggt 


caaggctgca gtgagctatg 


10920 


atcgcaccac 


tgcactccag cctggacaac agagcaagac 


cctgtctgaa 


aaaaaaaaca 


10980 


aaaaaaaaag 


ttcctttctt tttgtggctg gatgacatcc 


cattgtatgg 


ccacagcaca 


11040 


ttttgtttgt ctgtttatcg ggtggtgggc agtggtttcc 


accttttgtc 


tcctgtgaat 


11100 


aatgctgctg 


tgaacatttg aattcaagtt tttgtttgaa 


cacctgttgt 


gaattatttg 


11160 


gatatatgtg 


taggggtagg attgctgagt cctatggtaa 


tgttaggttt 


gacttactga 


11220 


ggaaccatta 


aactgttttc aacagtggct gcgccgttct 


gcatccccac 


cggcagtgtg 


11280 


tgagggttct 


gactttacct cctcacaaac gcttcttttc 


catttaaaaa 


aatattcagc 


11340 


caggtgctct 


ggctcacgcc tgtaatccca gcactttggg 


aggccgtggc 


gggcggatca 


11400 


cctgaggtca 


ggagttcgag acgagcctgg ccaacatggt 


gtaaccccat 


ctctaccaaa 


11460 


aatataaaaa 


ttagccgggt gtggcagcgg gcgcctgtaa tcccagctac 


ttgggaggct 


11520 


gaggcaggag 


aatcacttga acccgggagg cagaggttgc 


agtgagccaa gatcgcgcca 


11580 


ctacactcca 


gcctgggtga caagagtgaa actccatcta 


aaataaaaca 


aaaataaaaa 


J. 10*i U 


taaataaaaa 


tttattaaaa cattcatcac agccagccta 


gtgggtgtcc 


catgtggctt 


11700 


tgcctcgcat 


ttccctgata actaggatgc tgagcgtctt 


gtcccaggct 


tgccacacct 


11760 


cagcactttg 


agatacgtcg cacagtcccc atttgcgaac 


gagaaatgag 


gtttagggaa 


11820 
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12360 
12420 



cagcagctgt gtcatgteac acagcgagca gggggtctct gagccgtctg accccacagc 11880 

cgaccaagct ccaatcctta ccgcctccta gtgttgtgga tgtagcccag ggtgctccca 11940 

catttttcag atgagaacac cgaagctcaa aacaggagcg ttttgtccac attggataca 12000 

cgatgtctgt ggtttggtcc tgaagtcact ttatatctca gtggtccaga ctggagtagg 12060 

acagggggtt ctggggaatg gggaaggtgt ctcaggtgaa aggaaggaat tccagattct 12120 

ccatactgtc cttgggaagt tagaagactc agagggtctg gcaaagtcag acaaagcaag 12180 

agaaatgcag tcaggaggaa gcggagctgt ccaggaacag gggggtcgca ggagctcacc 12240 

cccaggaact acacttgctg gggccttcgt gtcacaatga cgtgagcact gcgtgttgat 12300 
tacccacttt tttttttttt ttgaggtgga gtctcgctct cttgcccagt ctggagtgca 
gtggcacgat ctcggctcac tgcaagctct gcctcccggg ttcatgccat tctcctgcct 

cagcctcccg cgtagctggg actacaggcg cctgccaccg cgcccggcta atttttgtat 12480 

ttttagtaga gatgggattt cactacatta gccaggatgg tctcgatctc ctgacctcat 12540 

gatccgcccg tctcggcctc ccaaagtgct gggattacag gcgtgagcca ccgcgcccgg 12600 

cccgatttcc cactttaaga atctgtctgt acatcctcaa agccctatac acagtgctgg 12660 

gttgctatag ggaatatgag gcttacaggc catggtgctg gacacacaga agggacggag 12720 

gtcaggaggt agaagggcgg agagagggaa caggcggagg tcacatcctt ggctttcaaa 12780 

atgggccagg gagagacacc ctctgagcat ggtaggacag gaaagcaaga ttggaacaca 12840 

ttgagagcaa ccgaggtggc tgggcgtggt ggcttacgcc tgtaatccca acactttgga 12 900 

aagctgaggt gggtggattg cttgaggcca ggagttcaag accagcctgg ccaacatggt 12960 

gagaccccgt ctctactaaa tatacaaaaa ttagccaggc gtgatggtgc atacctgtaa 13 02 0 

tcccagctgc ttgggaggct gaggcaggag aattgcttaa acctgggagg cggaggttgc 13 080 

agtgagccga gatcccgcca ctgcactcca gcctgggcca cagagtgaga ctccatctca 13140 
aaaaaaaaaa aaaaaaaaga taaaaagacc aaccgaggaa ttgaagtggg ggggcgtcac 
agtagcagaa gggggatcgt ggagcaggcc accctgtggt catgcactgg aagctcatta 

cctgacgatt tggagctcat cactgggggc ctaaggagaa tagatactga aggatgagga 13320 

gtgatggcgc ggggcacggg tgtctttggt ggccagaact tggggactgc tggggtgcct 133 80 

cactgcaggc cttctcagcg ccctttatat ' gcttacacag gctgtttcta agagggggat 13440 

acattgcata agcgttttca gactacctca tcatgggtcc ctttctttac cctctgtggc 13500 

cctggtggcg cactctctgg gaaggtgcag gtggatgccc agacccgccc tgccatccac 13560 



13200 
13260 
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ctgcacgtcc agagctgact tagcctcgag attgctgctg gcacctcctg ccccgggaca 13620 
cctcggatgt gcccgtggag atgctggctc tgtgttttct gctggagttt ggtgcgtctt 13680 
ttcctcctgc aagtggccac cgctcttggg tatgtcctca ggcttctgcg agtcatggct 13740 
gcttctcagg tccttgccca gcgccaggag caaaccctcc tggcactttg ttcaggggtg 
gatgcgccag tgttcctgct gtggaccccc atctcacatg agggtcttgg gcctgcaggc 
tcgttcagga aacacccgct gagtacgcag tgtgtgccag ctgtgtccca ggcaatggc'g 1392 0 
gggacagtgg ctgctgctgg ggttgtggtg gcttctgggg actctgggga cagctgaggt. 
gcaaggagcc acggctcctt gaggatgcag ttggactcca ggtggaaggg atggttgggg 
gaggtataaa tggggtcagg gaggagacac atttggaaca atgggaacat ttttaagatg 
.ctatgtcggg aggcaacaag gtggccaacc caggtgctga ggagcccaca ccagccctgg 
acgtgttttg ccgctcacct ttgctgggga gtggtgggag agaggattcc gttccacgtg 
gtggtgtgcg cagctgggct gtgtggagct gggcgctagg aggaaggtgc tttctgcggg 
gctagccggg ctctgccttt gaacacaatc aggctccagg ttttcagcat ccagtgcatg 14340 
agaggacttc acgggcagct gtggctgatc ccttgatgaa ttgggagaag aacaaaggtc 
tatgaaatga ggtttcatgt agatggcatt agagacgccc acaacagatt tacagagtgg 
agcggagacg gcggatgggt ctgggaggcc cctcctgctg gccttgactg tgacagctgt 
cctgggaatc agcttccagg ccgccccagc agcctgactg acacacacag gggttttagc 
cccatcctgc gaccagctgt tgccatcatc agtgacagct gggagtggcg gtggttccag 
ccctgggcac cctccccacc tgctggggcc cacccagggc agtcctgaca cctacaggtt 
: gcttggagcc gcatccgagt cctgccccac cacgtgtgaa gcccgagtgg tcgtgggctg 
aggtcccctg attgcatccc cacttccctt ctgcttcaca tagctgcctc ttctcaccgt 
ttttccagcc "tcctgggcta ggaattccag tgttgtgctg gctttgcccc aggacacctc 
cttagccctc ttcctgagtc tagagccccg ggggttggaa gttctggccc ctgggacacc 
tgcagccaca ctcagcttct cctgtgagcc tccagcatgt cccctcagga ccaagccctc 15000 
acgttcttgc ctccccgccc acctgggctc agccagggga aggcctggct gggagcgtct 15060 
cccctctgcc ctgcccttct cccctctacc ctgcccttct ctcctctgcc ccgccatggc 1512 0 
ttttatatcc tgtgccacaa gacatggctg tgtgtgaaag tggcagggtc tggcatctct 15180 
gtgggtctct gaggcccacg ctccagtgcc actcttccca cccgctggcc gtgccctcat 1524 0 
gctggaggga cagcccagcc ctctcccgaa ccccagcccc atgtgcccag ctgcccccgg 15300 



13800 
13860 



13980 
14040 
14100 
14160 
14220 
14280 



14400 
14460 
14520 
14580 
14640 
14700 
14760 
14820 
14880 
14940 
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15360 
15420 
15480 



15660 
15720 
15780 
15840 
15900 



ccctctcccc tggaagccgg ggtcactcca gccgtatgcc atggtgggga catcctgctt 
ccttggcctt ccagggaagg tcctctttcc aaatggcgac acctggtccc tgcctggagg 
ctggaagctg tggcccttgt atgcccctcc agggtctgtg cgctcggttg gcccgagttc 
ccatcaccgt catcatcacc atcatcattg tcatttcgct tgtctgtgag ccggcctggt 15540 
ctcccagagc agagaccctc tgaggtccag cctgagttgg ggtctccgtg ctgacccctg 15600 
acggggactc aggacgtacc aggtctgggt ' caggagtgac ccccaaacct cgtgcccttt 
gacaggcacc cctgactttt gctaagtggg tggaggtgac atcacttaca gcgggagtga 
tgggacaggg tctgttggct gcactgtgct cccagggatc tggggagagg ctatatccct 
gggctttggc actgcagagc tgtgtgtgtt tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg 
tgtgtgtgtg tgtgtgtgtg tttgcgtgcg cgcacatgtg tataagatct ttttttatta 
catgaagcaa gataactgtt gctgtttcct tttgggtttt gtgttcaaca gagtggggta 15960 
cttcttccct cagacaacag aactctcccc tttaaacacg tgctgtcaga gggtgggtct 16020 
tgggctcatg tctgtttgca cagccgagtc agaggaaaca cagggttctt cataaaaaca 16080 
ctgcacagca ggcgactgtc cagagtcagc ctgcaggacg gcagcagccc tgcccctcag 16140 
agcacagcta gggtgggctg ctttgggatc tcccgtcatt ccctcccagc tggcagccgg 
cggccggccc attccttggt gtgctggtca ggggggcgtg cgcctgctct gctcaccctg 
ggaatgggac agaagctggc agctcggaga ggacagggct ggacccttgg gtggcctctg 
gctggaccat ctcattgtcc tcagacacag cctctcgggt ctagtttcat ttcctgaaaa 163 80 
acaagtgcac agaactagag caggagtcga gagctacggc ccccgggcca gatccagccc 
tgccacctgt tttcacacca tgctcaagct gagtgggttt tacatttttt aattacttga 
aaaaaaaaaa gccaaaggag gtttcatgac ccatgaaaat tatatggaat tcaaaaaaaa 
aaaattatat ggaattcaaa tttcagtgtc cataaataat ttcttgagac agggtctcgc 16620 
tctgtcaccc aggctggagt gcagtgctat ggcatggctc gctgtaccct tgacctccca 
ggctcaagcg atcctcctgt ctcagcctcc tgagtagctg ggactacggg tgtgtgccac 
caagcccggc taattttttt ttaattttag taaagacagg gtctttctat gttgcccagg 
cttttctgga actccatctt ggcctcccaa agtgctggga ttacaggctc gagccacgga 
gcccagcctg tttttgtttt ttcactgata aagttttgcc gggtgtggta gtgtgtgcct 
ctagcgattt gggaggctga ggtgggagga tcgcttaagc ccaggagttt gaggctgggc 
tcaagtgatc aggaggtgaa ctatgatcat gtcattgcat tccagcctgg gtgacagagc 17040 



162 0 0 
16260 
16320 



16440 
16500 
16560 



16680 
16740 
16800 
16860 
16920 
16980 
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aagaacctat 
tggtcccagc 
tgcagcgagc 
ctctttaaaa 
gtgctctcca 
tctatccttt 
cttccggcac 
tttggctctg 
cgcccagagg 
aggcctcctg 
agagtggctt 
gccactttgg 
gatgttttcc 
agtagggggc 
agggagtggc 
ttccgcaggc 
ccagccgggt 
tggagcaagg 
cttgggccct 
cttgcctcac 
aatagagcaa 
gtggctcacg 
tcgggagttt 
aaataaataa 
agctaaggca 
gccattgcac 
taaaaaataa 
ctttggaagg 
gccaacatgg 



ctcttaaaaa 
tacttaggca 
caagatcgtg 
aaaaaaacca 
tgctgctttc 
acaaaaaagt 
tggcgttcct 
ttccctccca 
cttggctcgg 
agtcagccca 
ttgccgatgg 
gattccttcc 
cagttggccg 
ttgggggagt 
ttctgagcca 
cctgccggcc 
ggaacagccc 
gtagagaggg 
ctgtgggctc 
ttcagttcaa 
gaagagaaaa 
cctgtaaatc 
gtgaccggcc 
ataaataaat 
ggagaatcgc 
tctagtctgg 
aaatagtgac 
aaggccgaga 
tggaacccca 



tatatattta 
tctgaggtgg 
tcactacact 
aaaaacatgt 
tgctccagag 
ttgctgacct 

gggtgtCCta 
ggagccagga 
ctcaggcatt 
gagggaacct 
gttgggccgg 
tccagtatta 
ggtccaccgg 
gtcctgggtt 
ccataggggt 
cagagccagc 
gtcgcctcct 
aggaagtggc 
ctttttattg 
catgtctgga 
atctctccaa 
ccagtacttt 
tgaccaagat 
acaaaattag 
ttgaacctgg 
gcaacaagag 
ctctggccag 
tgggcagatt 
tctctacaaa 



aaaagtattg 
gaggatggct 
ctagcctggg 
attggaacac 
acccttatgg 
ctgtcctgga 
aatttggccc 
caggcacgtt 
cttggaaata 
gccccaggtc 
tcaagatgtg 
gagcaactga 
gtgccctggg 
ctggaatcca 
ctctgtggga 
gtcttgcgct 
ctcactttgt 
tgccggccgc 
ctcttcaatg 
agtttggtat 
gagataatag 

gggaggctga 

ggagaaaccc 
ccaggcatgg 
gaggcaaagg 
tgaaactccg 
gtgtggcagc 
gctttagcac 
aatagaataa 



ggtgtggtgg 
tgagcccagg 
tgacagagcc 
agccatgcct 
cctgaaagct 
aaattcatct 
ctgttatttc 
ctctgcatct 
tctggctcca 
tgggggaggc 
ctgaaagttg 
gagctgctca 
attctgggat 
ggtggcaagt 
ggctctgccc 
tgccgaggct 
tttggggcca 
tgcccagcac 
aagccaggga 
taaaattaag 
tgacctctga 
ggcgggcaga 
cgtctctact 
tggcgcctgc 
ttgcagtgag 
tctcaaaaaa 
tcatacccgt 
aggagtttga 
aatttaagag 



ctcacgcctg 
agtttgaggt 
cagaccctgc 
gttcagtcac 
gaaaatattt 
cccaagttct 
tgaactctgt 
tgtcccctga 
ggaaaggcag 
ctgacccagc 
tcctcagaag 
ttgcaagcct 
ctgggtggaa 
ggtgaggttc 
atccaggaga 
acagccagcc 
cctgggagtg 
ccttgtttgc 
aatggacttc 
aaagtgtgga 
gctgggcgcg 
tcacctgagg 
aaaaataaat 
ctataatccc 
ccaagatcac 
aataaataaa 
aatcccagca 
gaccagcctg 
gtaatagtga 



17100 

17160 

17220 

17280 

17340 

17400 

17460 . 

17520 

17580 

17640 

17700 

17760 

17820 

17880 

17940 

18000 

18060 

18120 

18180 

18240 

18300 

18360 

18420 

18480 

18540 

18600 

18660 

18720 

18780 
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ccttttggta 
tggtgtttgt 
gggaaagggt 
cctgcacttc 
gggatggggc 
gcctttgcgt 
accagggctc 
ctcagacttc 
tcagaaccgc 
ttctgactta 
tgtgacgtta 
catgggggtg 
gcctggcttt 
ttgagacaag 
tgcaatcttc 
attacaggtg 
gtttcaccat 
ggcctcccaa 
tgtcctgtgc 
. tgccttacag 
atgtaatctg 
atcctgtcct 
tacagttctg 
tccgttggtt 
ttgcttttaa 
acagggtgga 
gttcatccgg 
caggccctgc 
aggccctgca 



gatcgaaacc 
gttctgtgcc 
tcgcccgcta 
ttcattgtac 
caacccctca 
gtcttcctgt 
catcacctgc 
caggcacggt 
ctgttgggag 
attgctccag 
gccactgggc 
caaggacttg 

gggtcccaga 
atctcaccct 
gctagctgcg 
tgtgccgcca 
gttggccagg 
agtgctagga 
catccaacaa 
ttttgccacc 
gtagcaggag 
gagagccgga 
caggctcatg 
caggtgtttt 
tcagcattga 
aagcatctga 
ccctgaccca 
cgttgctcat 
gttgctcata 



tggattgctt 
gatgtccctc 
ctgtgagccc 
gccagggctg 
gatgttttct 
tgtggaggca 
tgttcattgc 
cgcaggggct 
ctgttatcat 
gttggggcca 
taatctgagc 
ccgaggaccc 
gcttccagtc 
gtcacccagg 
ttccagcgat 
tgcccagctc 
ctgatctcga 
ttacaggctg 
cagatgaccg 
tgcaaaacag 
gcctgtggaa 
gaacttggat 
gatcatcaga 
taccttgaca 
ctcttcaaga 
gacctgcagg 
gcgggaaatg 
aactgcccca 
actgccccag 



tctttttcta 
ccccagccct 
ctcctctcac 
ggtccctccc 
gagtgtcaca 
aaaatgtgaa 
acaccggagc 
ggtctaacca 
gataccatac 
ggccgttgtt 
ccctcagtta 
agagcccctt 
cccttcccgc 
ctggagtgca 
tctcctgcct 
gttttttttt 
actcctgacc 
ggatcacact 
aagtctttgt 
aggacttgtc 
gcttgccttt 
gttgcaccta 
accacgtcct 
gtattttctc 
aaaatattta 
ctcagactta 
agttcacaga 
ggtccttaca 
gtccttatat 



aatgctgatt 
gttattgtga 
gctgggtgtc 
tggagtggtt 
cacaggtgtg 
gaaccctaga 
atccaggcat 
tgttcccgcc 
ctgggccctg 
tgctgttttg 
caggtggaga 

gggggcagag 

tctcctaaca 
atggcatgat 
cagcctcccg 
gtacttttag 
tcaaatgatc 
gtgcctggcc 
ttcttaacat 
gcttttctgg 
aatggccttg 
actcaacctt 
atctcacgcg 
ctcggtggct 
gctgctacat 
gaaccagaag 
gaagcgggag 
tttgctccag 
ttgctccagg 



ctttt" ctttg 
gtggaagaag 
cttggagaag 
ctgtgctgct 
tgcattcatg 
tgattttggg 

gggtggagag 

cgcctgctcg 
ggctatccga 
ttgtttcttc 
aactgagacc 
ctgaggcggg 
gctttttttt 
ctcggctcac 
agcagctggg 
tagagatagg 
cgcctgcctc 
ctagcagctt 
gcattccatc 
taagctggaa 
tgtctctttc 
cctgttaaca 
gctgtatgct 
tttgcggtgg 
ctcagaggag 
tgccctcaga 
aactttgccc 
gtcctgcccc 
tcctgcccca 



18840 

18900 

18960 

19020 

19080 

1914 0 

19200 

19260 

19320 

19380 

19440 

19500 

19560 

19620 

19680 

19740 

19800 

19860 

19920 

19980 

20040 

20100 

20160 

20220 

20280 

20340 

20400 

20460 

20520 
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I 







ttactctatg tqqtqqgtgt gatctggagc cctccgccca ttgctgcacc 


20580 






attgctaatt gatcccagga ctccttcctg cggagcacgc cctggttctc 


20640 




caggcagccg 


ctgcctgtca gcctgcagtg gttcgggaga ggacacctgc ttgcctggtc 


20700 




tgttccaaat 


cttgcttctc atcccagcac aggtaggggg tgctatggga aagggatcct 


2 07 60 




cagttggccc 


tgtcactgct ctatcagctg gggacgtggc atcctagtga aaacatcatg 


20820 




gccgggcgcg gtggctcacg cctggaatcc cagcactttg ggaggctgag gagggtggat 


2 0880 




cacttgaggt cagaagttcg agaccagcct ggtcaacatg gtgaaaccca tctctactaa 


20940 




aaatacaaaa 


attcgccagg tgtggtggcg ggtacctgta atccgagcta ctcgggaggc 


21000 




tgaggcagga gaatcgcttg aacctgggag gtggagcttg cagtgagccg agatcttgcc 


21060 




actgcactcc 


agcctgggca acagagtgag acgctgtctc aaaatctcaa acaaacaaac 


21120 




aaacaaaaaa 


caaacaaaca aagcgtcatt tatccagcac ccctggggaa ccatgctacc 


21180 




tggtgtttta tggtacctgg caaggtgcag gtgaagttgc tgctcttggg cattgaaccc 


21240 




gtcttgtttg gggcagctca ggccccaggc agggtccggg ttggctctcg ttggtgtggc 


21300 




cctggcccat 


ccagacctat atttctgccg tcctgcaggt gatcaatgtt gatgggacga 


21360 




agaggcggac 


cctcctggag gacaagctcc cgcacatttt cgggttcacg ctgctggggg 


21420 




acttcatcta 


ctggactgac tggcagcgcc gcagcatcga gcgggtgcac aaggtcaagg 


21480 




ccagccggga 


cgtcatcatt gaccagctgc ccgacctgat ggggctcaaa gctgtgaatg 


21540 




tggccaaggt 


cgtcggtgag tccggggggt cccaagccat ggctcagcca tgcagacttg 


21600 


P' 


catgaggagg 


aagtgacggg tccatgcctg ggcataagtg ttgagctcag gtgccccgac 


21660 




ctggggaagg 


gcaggacagg aaaggtgaca gtatctggcc aaggacagat gggaagggac 


21720 




caagggagct 


gattagggag tggttatgga ctaggaatgt cggtaacaat ggttagaaag 


21780 




tgactaacat 


ttgttgagca cctgctgtgt gcccggccct ggccgggagc cttcgtgccc 


21840 




acagtgaccc 


cgtctgcaaa tgtagttcct tgccctactc gcactgggga gcaggacgca 


21900 




gagccgtgca tctcacaggt gccaagctca ggactccctc ctgggtctgc ctgggctggg 


21960 




ctgtgcttgt tgcccctgtg gcccacgcat gtgcaccttc cacctgaaag ccaggatctt 


22020 




caggacgctc 


cccgaggagg tcgttgtctg gcacaatgat ttgtctcttc ctgaaaaggt 


99 0 8 0 




gacagagtta 


cactggagag agcagcatcc aggtgcggca gggacaggcc tggggctcgc 


22140 




gggcagggac 


tctgtgtcct gccggggtcc cacactgcac ctgcttgtca gaggcactca 


22200 




gtcaatcttt 


gctgatgaag gatgagagga cagaggacgt gatgcttgct gctgcattgc 


22260 
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ctgcagtcct gggtgagatg cccgggttga ctctgctgcc cgtcgggtgg atgtgatgtc 22320 

agatccccgg ctttaaaata cgagggagct gggaattgag ggagcaggtt ggggcagaaa 22380 

gcacagcccc gtggaagcct ggagctgagg cagtgtgggc gacccctgga gcagtgagtg 22440 

cttccttcat ggccttcatc gcaccctgca gtcctcatgt aggggatgcc atccatgaat 22500 

ttagttttcc cagcctcctt taaaaacgcg ttcatgctgg ggccggggca gtgcagtggc 22560 

tcacatctga aatcccacca ctttgggagg ccgaggcggg tggatcatga ggtcaggaga 22620 

tcgagaccat cctggctaac aaggtgaaac cccgtctcta ctaaaaatac aaaaaattag 22680 

ccgggtgcgg tggcgggcgc ctgtagtccc agctactcgg gaggctgagg caggagaatg 22740 

gcgtgaaccc gggaagcgga gcttgcagtg agccgagatt gcgccactgc agtccgcagt 22800 

ccggcctggg cgacagagcg agactccgtc tcaaaaaaaa aaaaaaaagt acaaaaaaaa 22860 

aaaaattagt ctgggtgtgg tatcacgcgc ctataatctc actactcgag aggctgaggc 22920 

ggagaattgc ttgaacccag gaggtagagg ttgtagtgag cccgtatcgt accactgccc 22980 

tccacctggg caatagagcg agactctgtc tcaaaaagaa aaaaaaaaaa agaacattta 23040 

tgccaggtgt ggtggctcat gcctgaaatc ccagaacttt ggaagactga ggcaggagga 23100 

tcacttgagc ccagaaattt gagagtgtct tccctgggca acatagagag acctcatctc 23160 

taccagaaaa aaaaaaatta gcccggcatg gtggcatatc cctgtggtcc cagctactta 23220 

gggggctgac gtggcaggat cacctgagtc tggaggcaga ggttgaagtg agctgagatc 23280 

atgccactgc actccagcct gggtgacaga . cagagaccct gtctcaaaaa aaaaaaaaaa 23340 

aaaaagcatt tactatccac catggaaggt gagactgacc tgtgagtgat tgttcaaaga 23400 

acaaaaaata aaccccagag ataagacaaa agggtgcctc catgggggtg tgatttaaag 23460 

ctgagaaatt gggcttcttc cccctcccct ctcaccccgt ggtttgctaa aggagatggg 23520 

aaaaaggatt ctttttttgg ctgaaatatt taacactaaa ttaaagccaa ttttaacagc 23580 

actttggttg atgagtgaaa ttaacagact ggccaaaaat aaacgaacgg tctgtactat 23640 
gtgaaaaaga ggcagctttg gccatgctgg gccaatgtga gttttcaggg ttgctgggaa 23700 
tgtctgtgaa tcggaggaag ggcctagctg ggactctcag gagccaaggc cctgaggggc 23760 
aacttgcctg gtccctgccc tgaggcgttc actgctttct tcctgggcca gatcacaggc 23820 
ccggaggctg gaccactggg ctggcactct tgccgagctg ctccctgact tcctgaccat 23880 
gctcctttca gcagccttgc tgcactttag tttccttgaa tgaaaaatgg ggatgagaat 23 940 
agctcctacc tccaaggtga atggagtgag ttcggacagg tgactccctg ggaccagtgc 24000 
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ctggcgcctg acaaggtcca gtcagagccc gcactgctgt tactgatacc cttggctgta 24060 

ccaggggaga acttggttgc cattgccagg tgttctccca ccacccccac tactgtccct 24120 

gtttgatgtg tggcgggaat aaagctgtgc acattggagc ttttggcaca tcctggcttt 24180 

caggtgaaag gtgcgtgtgt gtttgagggt ttagcctggc caacccagcc atgaggtcgg 24240 

acctgacctg ggggtgagtc ctgagctcgg cacccctgag ctgtgtggct cacggcagca 24300 

ttcattgtgt ggcttggccg cacccctttc cctgctgggc tgttgatgtt tagactggag 24360 

cctctgtgtt cgcttccagg aaccaacccg tgtgcggaca ggaacggggg gtgcagccac 24420 

ctgtgcttct tcacacccca cgcaacccgg tgtggctgcc ccatcggcct ggagctgctg 24480 

agtgacatga agacctgcat cgtgcctgag gccttcttgg tcttcaccag cagagccgcc 24540 

atccacagga tctccctcga gaccaataac aacgacgtgg ccatcccgct cacgggcgtc 24600 

aaggaggcct cagccctgga ctttgatgtg tccaacaacc acatctactg gacagacgtc 24660 

agcctgaagg tagcgtgggc cagaacgtgc acacaggcag cctttatggg aaaaccttgc 24720 

ctctgttcct gcctcaaagg cttcagacac ttttcttaaa gcactatcgt atttattgta 24780 

acgcagttca agctaatcaa atatgagcaa gcctatttaa aaaaaaaaaa gatgattata 24840 

atgagcaagt ccggtagaca cacataaggg cttttgtgaa atgcttgtgt gaatgtgaaa 24900 

tatttgttgt ccgttgagct tgacttcaga caccccaccc actcccttgt cggtgcccgt 24 960 

ttgctcagca gactctttct tcatttatag tgcaaatgta aacatccagg acaaatacag 25020 

gaagactttt tttttttttt tttgagacag agtcttactc tgttgcccag gctggagtac 25080 

cgtagcgtga gctcagctca ctgcaacctc cgcctcccag gttcaagcga ttcttctgcc 2514 0 

tcagcctcct gagtagctgg gactacagac atgcaccacc acacccagct aatttttttt 25200 

atatttttag tagagacagg gtttcatcat gttggccagg ctggtcttga actcctgacc 25260 

tcaggtgatc tgcccgcctc ggcctcccaa agtgctgaga taacaggtgt gagccaccgt 25320 

tcccggcata ggaaaacttt ttgccttcta aagaagagtt tagcaaacta gtctgtgggc 25380 

tggccttctg attctgtaaa gaaagtttga ttggtggctg ggtgcggtgg ctcacacctg 25440 

taatcccatc actttgggag gccgacgtgg gcatatcacc tgatgtcggg acttcgagac 25500 
cagcctcacc aacgtggaga aaccccgtct ctactaaaaa tacaaaaaaa aaattaaccg 25560 
ggcatggcgg cgcctgcctg taatcgcagc tactcaggag gctgaagcag gagaattgct 25 62 0 
tgaacctggg aggcggaggt tgtggtgagc tgagatggca ccattgcact ccagcctggg 25680 
caacaaaagt gaaactccgt ctcagaaaaa aaaaagtttg attggtgtaa ccaaagcgca 25740 
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tttgtttatg gattgtctgt 


ggcagctttt gttctgccga gatgagttgt gacagatctg 


25800 


tatgggctct aaagcctaaa acatgtgcca tccgcccctt 


tacagaaaaa gtgtgctgac 


25860 


ctctgttcta 


aagtattgga 


caactacaat gtttgctcat 


ttattattct atgatttgtt 


25920 


ttctgctttt tgttgttgtt gttgttgttg agatagggtt 


tccctctgtc actcaggctg 


25980 


gagtgcagtg gtgtaatctc agctcactgc agcctcgacc 


tcctgggctc tagtgatcct 


26040 


ctcatctcag 


cctccctagt 


agctgggact 


acaggcacac 


accaccactc ctggctgatt 


26100 


tttttttttt 


tttttttttt 


ttgtggagac 


agggtttccg 


catgttgccc aggctggttt 


26160 


raaartccta 

w GL Gt+ » w w V* l-» t4 


ggctcaaaca 


cccacctcag cctcccaaag tgctgggatt acaggcgtga 


26220 


cr c* r 1 ar c a t crc 


ccagcctatt 


ctactgtttg 


tattacatag 


ctttaaaaga ttttttatga 


26280 


ctttaagtca 


caagggttct 


ttgtagaaaa 


aaatatatat 


ataggaaagt ataaaaagaa 


26340 


agtaaaaatt 


gtccataacc 


tctccagcca gagacgaccg ttgctgacac ctcagcatat 


26400 


tgcctttaag 


tcttttttct 


ctaagatagc 


atttctcttc 


atcacagtca tatgctacgc 


26460 


agaattctgt 


atcctgattt 


tttcacttga 


cattacaaca ggtatttgat ggcgctgtga 


26520 


caaactcttt 


ggcacaatct 


tttaaatgta 


tgaaatactc 


cactgcacag atgtttgctt 


26580 


ttaggcttaa 


ctgttctttt 


attttgcgtg 


tgctggttac 


agccgggcac agtggctcat 


26640 


gcctgtaatc 


acaacacttt 


gagagggtga ggcaggagga 


tcacttgagc ccagaagttt 


26700 


gagaccggcc 


tgggcaacat 


agtgagaccc 


catctctaca 


aaaaactttt ttaataagtc 


26760 


gggcgtagtg gtgcatagct gtagtcccag 


ccaccaagga 


ggctgagttg ggaggattgc 


26820 


ttgagcccca 


ggaggttgat gctgcagtga 


cctgagatta. ctccactgta ctccaacctg 


26880 


agcgacagag 


caagacttgt 


ctggggaaaa 


aaaaaaaaaa 


aatatatata tatatatata 


26940 


tatatataca 


tatatacata 


cacgcacaca 


cacataatat 


aaaaatatat atttataaat 


27000 


atataatata 


taatataaaa 


atatatattt 


ataaataaaa 


tttataaatt atatttataa 


27060 


gtaaatatat 


aatatataat 


ataaaaatat 


atattatata 


atatataata aaatatataa 


27120 


tataaaaata 


tatatttata 


aataatatat 


aatacatact 


tataagtata tatttaaaat 


27180 


atatgtaatg 


tatatttttt 


aatgtatgat 


atataatata 


catttataaa tacacattta 


27240 


tattatttta 


tataaaatat 


atataaaatc 


tccaagttgc 


tttttccaaa aaggtgtctt 


£* f O <J \f 


gctgcatttc 


aaacattcat 


ttaaaaactt 


gaatgctggt 


gatctggtcc agaatgtgtt 


27360 


cagtagctgc 


tgccagtggc 


caagcatctc 


gggagatgtc 


tacaaaacac gctggttctg 


27420 


gcctggcgtg 


gtggctcacg 


cctgtaatct cagcactttg ggaggctgag gcaggtggat 


27480 
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caactgaggt ctggatttcg agaccagcct tgccagcttg gtgaaacccc atctctacta 27540 

agaatacaaa aaaattagcc aggcgtggtg gcatgtgcct gtaatcccac ctacttggga 27600 

ggctaaggct ggagaatcgc ttgaacccag ggggcagagg ttgcagtgag ccgagatcgc 2766 0 

accattgcac tccaggctgg gcaagaagag cgaaactccg tctcaaaaaa aaaaaaaaag 2 772 0 

atgctggttc ctaaaatgtg gcccttttcc tcctcacctg ctgccagacc atcagccgcg 2778 0 

ccttcatgaa cgggagctcg gtggagcacg tggtggagtt tggccttgac taccccgagg 27840 

gcatggccgt tgactggatg ggcaagaacc tctactgggc cgacactggg accaacagaa 27900 

tcgaagtggc gcggctggac gggcagttcc ggcaagtcct cgtgtggagg gacttggaca 27960 

acccgaggtc gctggccctg gatcccacca aggggtaagt gtttgcctgt cccgtgcgtc 2 802 0 

cttgtgttca cctcgtatga gacagtgcgg gggtgccaac tgggcaaggt ggcaggctgt 28080 

ccgtgtggcc ctcagtgatt agagctgtac tgatgtcatt agccttgatg gtggccagga 28140 

ctggtagggc cctcagaggt catggagttc cttcgtggag cgggtgctga ggctgtatca 28200 

ggcacagtgc tggctgcttt cacctgggcc gtctcaccga agtgtccatg gagcctgcgt 2 8260 

agggtgggta tctgtgtcga ttttacagat gcagaaacag gctcagagaa accgagtgac 2832 0 

ttccctaagg tcacataccc agttagagca gagctgggcc aggaagtgct gtctcaggct 28380 

cctgaccagg tctccttgct ttgcactctt gccaaaacca tgatccagaa ctgactttga 2 844 0 

ggtccccgga cctcaggctc ctccgaaatg gcctcttgga ggctgctgag ccacagctta 2 8500 

ggacccacct cgagaggcaa atgtgctttg agctgccagg cgtcctgggg gccctgcctt 2 8560 

gggcacgggg ttcagacagg ccccagatgt gtggggcgtc tttctggact tgagttttct 28620 

tttctgtgtg gtggacacag tgctcacccc ttaaagcacc tgtgatgtgt gcagcagccc 28680 

aatccctgcc tgtcgcctgt tctgctaggg aaggaaggaa gacttcagga tggcaggaca 2874 0 

acagaaagag gtccaggttt tagagcaagg gcaggtcaaa cttagaaaat tctggaatga 2 8 800 

ggatgtgcat ttcctcttct ggatctgcta aaagaagagg gaaggagggg ctgctggggg 2 8860 

aggagcccag agccgagttt acatccggat cccgcaaggc ctcccctgcc ctgaggtctt 28920 

gttttgtgat gtgcttgtgt ccatcctggt ttctgccgtg tccccaacat ccggccaagc 28980 

ttaggtggat gttccagcac acactcaccc tgtctgtgca cctgtttttg tgtccgtaag 29040 

tgggtattta ctcaccttac gagtgagcca ctgtgggaat tcagggaggt ggcgcagtga 29100 

ccacccctgg agggatatgt gtgtggcagg ggtcgagggt ctcgcccttc cctgcttcct 29160 

gcgcgtggct ttctccagga cggggagggc tgagctgaag aggtggggac agttgcgtcc 2 9220 



157 



29280 
29340 
29400 
29460 
29520 
29580 
29640 
29700 
29760 



ccccgccacc cactgtcctg cggtgagagc agactcactg agcctgccct tctcccttgt 
gccttccagc tacatctact ggaccgagtg gggcggcaag ccgaggatcg tgcgggcctt 
catggacggg accaactgca tgacgctggt ggacaaggtg ggccgggcca acgacctcac 
cattgactac gctgaccagc gcctctactg gaccgacctg gacaccaaca tgatcgagtc 
gtccaacatg ctgggtgagg gccgggctgg ggccttctgg tcatggaggg cggggcagcc 
gggcgttggc cacctcccag cctcgccgca cgtaccctgt ggcctgcaag ttccccaacc 
tggcaggagc tgtggccaca cccacgactg cccagcagcc tcaccctctg ctgtgggagt 
tgtccccgtc cacccctggg tgcctttgct gcagttatgt cgggagaggc tctggtgaca 
gctgtttcct gtgcacctgc tgggcactag gtcccagcta atccctgtgc caggactcta 
attteaccct ■ aacaeacatg gtggttttca ttgctgggga agctgaggcc tgagcacatg 29820 
acttgcctta ggtcac'atag ctggtgagtt caggatcccc cagagatacc agggccagca 29880 
ctcgatcccc acccagccct gaaccccacc atgtgctggg attgtgctgg gagtgtccac 29940 
acgcctggga ccccagggct ggtgctctca tctccttttt ccagatcatg agaatgaggc 
tcagggaagt ttgaaaaaaa cctatcccaa gtcacacagc aacaggagca ggatttgaac 
ccagaaaagg ggaccgcaca ctctgttctg ctagagtagt tagctgtcct gggtgatatg 
gcaggtgaca ggggcaactg tgcttaacaa aggaaccccc atcccccctg ccaagttggg 
agactagaag gtcaggggca gaagctctga agggccaggt gcagtggctg acacctctaa 30240 
tcccagcact ttgtgaggbc aaggcgggca gatgatttga gcccaggagt tcaagatcag 30300 
cctgggtaat gtagtgagac gccatctcta caaaaaaatt ttttaaaaat tagctgggca 30360 
tggtggttca tgcctgtagt ccaagctact tgggaggctc aggtgggagg attgcttgag 30420 
cccaggaggt tgaggttgtg gtgagctgtg atcatgccac tgcactccag cctgggcaat 3 0480 
agagtgagac cgtctccaaa aaaaaaaaaa gaagaagaaa aagaagctct gaggctccaa 3 054 0 
gtccccaggc accccttggc ttgagggcag acaagggagg agagggtcac ctgggcagcc 30600 
ctgacttttg tcccctggca aagggacctt cagtgacctt ggccctagga gagcctctga 30660 
gcacgtcagc catgtcgaac cgctcaggaa gggcagcaag aatttggctt ctgacctctg 
cctctcctac tcgccatctg cactgggtgt ggttgtgccc attttacaga tgaggaggct 
ggggcatcga ccagctgaat gccttgtccc aggtactgcg taggcagagc tggcagttga 
accccgtgtc ctggttgtcg ctgggggtgg gctgcaccct gacttgtgag gccagtagca 
aggtttgcac gtgacttcgt gaccgtcacc cagctctgca gcacatcccg tgacccagct 
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catccaggcc gcatgcaaac ctgttgccag gcgagaaacc agtcaccgca cagctgtggt 31020 

tgcctgaaat gattaagctc attaatcacc ccggagtgag gacagactca gatgaaaacc 31080 

agcaaaagcc ctggaaactc atgtgaccct gccaatgagg gcggccatgt gcattgcagc 31140 

ctggccgtca ctcctcggta cgtgttttgg acttaaacgc tccggatgtt tactgagtgc 31200 

ttgattaata acatggaagg cctggtctca ttgctgtggg agtgaaggat gcacagccag 31260 

gcctgacatg atgagaacaa gaacctggag tctcgctgcc tgggtggtaa tcctggccct 31320 

gccacttagc aactgtgtga ctgtagccag gtcacttaat tttgctagat cctgcctgcg 31380 

cttcagtgga tcttgctggt tttccaaggt ggccaaacac tttaaggcat tcatgtggtc 31440 

gctaggctgc agggttgaac cctggctcac cccgcagggc gccgtgtgct ctgtggcctg 31500 

gctgtgcctt tgctgacacc gtgcccgtgt gtgttcatgc aggtcaggag cgggtcgtga 31560 

ttgccgacga tctcccgcac ccgttcggtc tgacgcagta cagcgattat atctactgga 31620 

cagactggaa tctgcacagc attgagcggg ccgacaagac tagcggccgg aaccgcaccc 3168 0 

tcatccaggg ccacctggac ttcgtgatgg acatcctggt gttccactcc tcccgccagg 31740 

atggcctcaa tgactgtatg cacaacaacg ggcagtgtgg gcagctgtgc cttgccatcc 31800 

ccggcggcca ccgctgcggc tgcgcctcac actacaccct ggaccccagc agccgcaact 31860 

gcagccgtaa gtgcctcatg gtcccccgca cctcactccc tcgttagatc aggctggttc 31920 

tgggagctga cgctgaaagg agcttctcat ctggggttcc tgggtgtaca tagatggttg 31980 

ggtaggttgt gcactgcaca agctgcatga tgctacctgg gggtccaggt ccaggctgga 32040 

tggacttgtt gcttcatcag gacatagata aatggccaaa actcctcagc tggaaggtcc 32100 

tgggcaggat ctttgggtgt gaaaaccagt cacaggggaa gggtgcttgc tcatactgcc 32160 

agcacagtgc tgagtgcttt ccatagcgct cgtttactcc tcaagcctgg agggtgggga 32220 

gtagcatggt cccatttcac gtacaaggaa cccgatgcac agagaggtgt ggcaacccat 32280 

ccaaggccat acaactgggg tgggttgagc cggggttgac tgtggcaggc tggctcaaga 32340 

gtccctgctc ctgaaccctt gccaggcagc ctggcatcag ctcggggaat ttttgccctg 32400 
acccttggaa gcaagtgggc ctctttgttc tcatgtcagt gatgagaaga gtgactttcc 
tatggcccct ctggagtaca ggtgtttcct gttggcgggc tcttccccca tgacatcagc 

agcgagctgg ttatgattcc ctacgcagaa cttgatagtt tataaagctc tttgtcatcc 32580 
aggccccgtt ggagtctcac gcagacctgg tcgcaggcgg ggctggtctt gcctgtccca 
gctgcatgga tggggaactt gaggcttgca aaggttaagg ggctgttcga ggcccaggct 
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ggcaggagat gggcctgggc cagagtctgg gacttcccat gcctgggctg tctttggtcc 32760 

tgttgctcac catccctccc tggggccatg accttagaga gccaaatgga ggtgcaggta 32820 

acccacggca aggaggggtt gccatgactc agagtccccg tcctgtggcc ggcagtacct 32880 
ggtgcaacga cttggatttc agaccagcca ctgtagcccg ctgacggtgc gctcgaagtg 
ccacagcttc tgaagccagg caggactcag gccaggagac tctgttagct gttgagaggg 

agaggccaac ggatgttctg gttctgctag agagctggtt cttcggatcc tggtaccagt 33060 

gcactgagag gaggcccagc ttgattctgg ggctgccttg tggtggcatg tgctgctcac 33120 

tgacaccctc gaggagtgtc ttctctcggg cttgttgact gtgcccggtt ttccgcagtt 33180 

cactggtgca cacataggca catagcaaac cgcacacaca gtcgtgggta tgagtttcac 33240 

tacattccac caccagtgtt cactaccatt acctgccttc cgtcttaagt gttcatcatt 33300 

taaaaataaa tttattgggc tggacgcggt ggctcatgac tgttatccca gcactttggg 33360 

aggctgaggc gggcagatca cctgaggtca ggagttcaag accagcctgg ccaatatggt 33420 

gaaactccat ctctactaaa aatacaaaat tagctgggca tggtggggca tgcctataat 33480 

cccagctact caggaggctg aggcaggaga atggcgtgaa cccgagaggc agagcttaca 33540 

gtgagcccag atagcaccac tgcagtccag cgtgggcaac agtgcgagac tccatctcaa 33600 

aaaaaaaata aataaataaa agaaaaataa atttatgatc tatttcaaaa ataacacatg 33660 

tactttgaaa cagcagagac acatatgaca cggagaatga aattccccat agcgcacccc 3372 0 

caagagacag ccctggtccc cccgtctttc ccgtggacct ccagcggggc agatgctgag 33780 

ccgcctgttg tcgagtggcg tgctatcccg tcctccagct cctctgtggc ttacagacac 33840 

ccacctgcag ccctgtcttt gcctcctcta gcgcccacca ccttcttgct gttcagccag 33900 

aaatctgcca tcagtcggat gatcccggac gaccagcaca gcccggatct catcctgccc 33960 
ctgcatggac tgaggaacgt caaagccatc gactatgacc cactggacaa gttcatctac 
tgggtggatg ggcgccagaa catcaagcga gccaaggacg acgggaccca ggcaggtgcc 
ctgtgggaag ggtgcggggt gtgcttccca aggcgctcct cttgctggtt tccaggctgc 
tgcccctgtc cttagcagag ggaggaaaca gaggatggct ctgggtgaat gatgacttgg 

gcttcgatta tgtagtcaca gggtatgacc ctgagatgcg tggaacccog agactgtgat 34260 

tatatgtaga aactgggttt ccccgttgtt taagtagtca tggtggggtc agaccccaca 34320 
ggacttttgt cttttcaaga aagaaaatgg tcgtgtgtca tgcaggggta gttggtactg 
gttaatccag gtttatcctt tattttgtgg gaactgtaca gtcatttctg ctacaatgct 
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gtatatgctc 


ttctgaaaga cacctatgca aaatcgcaca gtaaaaatga cacaactcat 


34500 


agggaaagcg 


gggccagggc acagccctca aaatctccat 


caatgacatg taagaaaaga 


34560 


gaggaacctg ggaaatagca aagtgccttt tgcacattaa 


atggttagct atatcccaca 


34620 


atactgtgca 


ttcgtaaacg ttaatgctgc aataaatacg gcacttcacc ttgggaagat 


34680 




cttatgagtg tggaagggtg tagcgcatga gtttttgtga aacactggaa 


34740 


aaaaaattcrt 


gggaaatcaa atggaaagtt ctcaccccag 


gcgtggagaa gagtgggtca 


34800 


y y \^ w v»» 


agtgagccca gggaggtcag agacggaggt gtgtgtgtgg gtgtgaccct 


34860 


gcgcagttcc 


ctgccggctg tagttttttg cattcgctta atgtttctcg tggaggaaat 


34920 


tgtgcatgag 


caaatgtgaa accgtgctgt gctcaaattg 


tcctaataca tcattgcatt 


34980 


ggaacagatt 


ggcttttttt tttttttttt tttttttttt 


tttttgagat ggagtctcac 


35040 


tctgtcacca 


gcctggagtg cagtggcatg atcttggctc 


actgcaacct ttgcctccta 


35100 


tgttcaagtg 


attttcctgc ctcagcctcc tgagtaactg ggattacagg catgagccac 


35160 


cgcggccggc 


cagatttgca tttttgaaac aactgctagg 


ctgggcgcgg tggctcacac 


35220 


ctgtaatccc agcactgtgg gaggccgagg caggtggatc 


acctgaggtc aggggttcga 


35280 


gaccagcctg gccaacatgg tgaaaccccg tctctactga 


atatacaaaa atcagctggg 


35340 


tgtggtggcg ggtgcctgta atcccagcta ctcaggaggc tgaggcagga gaattgcttg 


35400 


aacccaggag 


gcagaggttg cggtgagccg agatcacacc 


attgcactcc agcctgggca 


35460 


acaagagcaa 


aactccatct caaaaaataa aaaatagaaa 


aacaagtgct gtagcggaag 


35520 


tgagcacttt gcggagtcag gcttgtgtgg cctgttccac 


aaatgatgtg ctcacggtgg 


35580 


cctcaggccc 


acctggagtc tgcagcatgg ggcacaacag gttcattagt gtagaattcc 


35640 


aggacaggcc 


tggctcctaa gcagccttct tttacaaaaa 


ctgcagagcc cgcctgtatc 


35700 


ctagcacttt 


999 a 99 cc 9 a aSf t 999 t 99 a tcacgaggtc 


aggagttcaa gaccagcctg 


35760 


gccaacatgg 


tgaaacccca tctctactaa atatacgaaa 


attagctggg tgtggtggca 


35820 


cgcgcctgta 


gtcccagcta ctcgggaggc tgaggcagaa 


ttgcttgaac ctgggaggtg 


35880 


gaggttgcag ggatctgaga ccatgtcatt gcactccagc 


ctgggcaaca gagcgagacg 


35940 


ccatctcaaa 


aaaaaaaaac ctacagagcc acacggcctc 


tttctccacc gagtgttggt 


36000 


gtgggagctt 


gtgttattgt ggtgaaatct tggtactttc 


ttgaggcaga gagaggctga 


36060 


gcgcctggag 


agactttcac atgggtcgcc atgtccgccg 


tcggtttcgc tgttgtgctc 


36120 


cccatctgaa 


ggctggtgcc gtccagacag gctggacgcc 


cctttccacc agatccttcc 


36180 
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tcccgcagca- gtttctagtt acgttgtact gtgaggtctg tgtccttggt tgatggcaaa 
agtcagccga attgaaattc agagccatgc. ctggctccct ggagcttctc tcctgggcag 
ctgtgatcat tgcctctgct gtggtgtggg tggtggaaat ggattccttt catcttgctt 
gctacaggtg actgtcacgt ggagtccttt ggagagaggg acgtgttaat tgatggatgt 
ggctcccatg ctgagaaagc tcctgggcgt acattgcctt agagtttcat tggagctgcg 
ttcttttatg gtgtctgcta ggcagaagtg atgaagactt ggaagaaaac ccagaaggtt 
ttccacttaa tttggaaaat gtgcttttcc cctcctgtgt cttttgctaa ggtccagcct 
cctgcagcct ccccgctctg tggactctgg ctttgattct ttattaggag tccccctgct 
cccccaaaag atggtgtcta aattatcatc caattggccg aggttttgtt ttctattaat 
tgtttttatt ttttattgtg gtaaatttat ataacataaa atttgccatt ttaattgttt 36780 
tgttattgtt gtttttgaga cagggtctca ccccagtgcc caggctggag tgcagtggtg 
cgatcatggc tcactgcagc ctcagcctcc agggctccag tgatcctctc acctcagcct 
ctctagtagc cgggactaca ggcatacact accacatctg gctgattttt tg'tatttttt 
ttttattgta gagacccgct atgttgccca ggctggtctc aactcctgga ctcaagccat 
cctcccacct caccctccca aagtgctggg attacaggca tgagccacaa cacccagcca 
ttttaatttt tttttttttt tttgagatgg agtctcactc tatcgcccag gctggagtgc 
agtggcgtgg tatcaactca ctgcaacctc tgcctcccag gttcaagcga ctctcctgcc 
tcagcctcct cccgagtagc tgggattaca ggtgcccatc actatgcctg gctaattttt 
gtatttttta gcagagacgg ggtttcacca tgttggccag gctggtcttg aactcctaac 
ctggtgatcc gcccgcctcg gcctcccaaa atgctgagat tacaggtgtg agccaccgtg 
cccggccttt ttttgttttt gagacagggt cttgccctgt cacccagact ggagtgcaat 
ggtgggctct tggctcactg cagcctccgc ctcccaggct caagttgtgc acctccacac 
ctggctaact gtattttatg tagagacaga tttcaccatg ttgcccaggc tgggcttgaa 
atggactcaa gcagtccacc cacctcagcc tcccaaagtg ctgagattac aggcgcgagc 
caccgcaccc agcccatttt acctattctg cagttgacag ttcagtggca ttcagtcagt 
tcacgaggta accatcactg ccattcatct ccagactact tcaccttctc ggcagatgtc 
cgaaactgtc cgcattgaac acactcctca tctccctctg acagccacca ttctactttg 
tatctctctc tgccttctct aggtacctca tgtaagtgga attataccaa tatttgccct 
tgtgtgactg gcttctttca tgtgacatgg tgtcctcaag gttcatctgt gttatagcct 



36840 
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gtgtcagaat ttccttcctt aaagcctgaa taataacccg ttgtaaaggc tgggcgcggt 
ggctcacacc ctctaatccc agcattttgg gagtccgagg tgggcagatc acttgaggtc 
aggagtttga gaccagcctg gccaacatag tgaaaccctg gctctactaa aagtacaaaa 
ttagctgggt gtggtggcgc gcacctgtaa tcccagttac tcaggaggct gaggcaggag 
aatcgcttgt acccgggagg cagaggttgc agtgaaccaa gattgtgcct ctgcagtcca 
gcctgggtaa cagagtgaga cttcctgtct caaaaaaaaa aaaaatcatc ggatggatgg 
acggaccact tcttgttatt tatccatcca cgggtgctag gtttcttcca cctttggttg 
tcgtgaataa ggccactatg aacatttcct tccgtggtga aggttttgta ctagtgagga 
aaaggcgtgt ttgtggtgtt gcataggatt ctggtaagaa agtttgcact aaccataagt 
atttgtacta cattaaaatg aaagctcagg ggccgggcgc ggtggctcac gcctgtaatc 
ccagcacttt gggaggccag ggcgggcgga tcatgaggtc aggagatcaa gaccatcctg 
gccaacatgg tgaaaccccg tctctactaa aaataccaaa aaactagcca ggtgtggtgg 
cgggcacctg tagtcccagc tacttgggag gctgaggcag gagaatggcg tgaacccggg 
aggcggagct tgcggtgagc cgagatcgct tcactgcact cgagcctggg caacagagca 
agactccgtc tcacgcaaaa ctctgtctca cgcaagactc cgtctcaaaa aaaaaaagag 
ttcagggttt atgaaactgg ccagccgcgt aaagtttgct gtgttgtttt tgtgcccggg 
aggagtgtgg ccagggtgtc acgtcacaca gtacacgttt ctcagatggt ggttctccag 
actgctgtcc caaagtctgt ttttgcatct ggttcccaca gacccaccct ccacggtgag 
cctgattttg gccagggtag ctggaatctt gcttgtcttt cagcccggca gctgtaccag 
tccagggtcc acagctagtg gcttttagga aggaatttgt tcagttggct ttgacacatg 
gccccctagg gtccacagct ctgtagtgat gtggatgttg ttatctacaa agacacatga 
tccttcgtgt ccagatgaaa gtgatgatgt ctttgcagct gcccagcaag gctgtgtgtg 
tgtgtgtgtg tgtgtgtgtg tgtgtgtgtg tggtgtgtgt gtggtgtgtg tgtgtgtatg 
ggggagggag gcaccctttc catctggggg tgtgtgtgtg tggggtgtgt gtgtgtgtgt 
gcgcgtgtgt gtggtgtgtg gtgtgtgtgt gtgtatgggg gaggcaccct ttccatctgg 
gtccaagaga ctgggcctgg ggaagacgct tctttttatc tacttagaga ctttgtttta 
tttgtatttt tttgagacag ggtctcactc tgtcacccag gctggggtat ggtgatatga 
gcatagctca ctgcagcctc ggcctcccag gctgaagcga tcctcccacc tcagccttct 
gaatagctgg gactgtaggc gtgcgtcacc atactgagct attgtttttt ttgtttggtt 
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38820 
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39060 
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ttgagaccag cctgggcaac atggtgagaa cccatcccta caaaaaatac aaacaaaaag 4494 0 

ggaccaggct gggcttggtg gctcatgcct ataatcccag cactttggga gaccaaggtg 45000 

ggctgatcac ttgaggttgg gagttcgaga ccagcctgcc caacatagtg aaatcccgtc 45060 

tctactaaaa atacaaaaat tagctgggtg tggtggcagg cgcctgtatt cccagctact 4512 0 

ggggaggctg aggtgggaga attacttgaa cctgggaggc ggaagttgca gtgagccaaa 45180 

attacgccac tgcactccag cctaggcaat agagtgagac tccgtctcaa aaaaaaaaaa 45240 

gggccagggg tggtagtgac aaagagaccc tatcccaaaa aaaccgaaca ctgaatcctt 45300 

gagactgagt aaggacactg tgaaattttt ctgggtgggg cagggaacag agcgtcttct 45360 

gtcatttctt ccacctgggt gtggtcagct ctccctccaa gctgcctcct cttcttctca 45420 

ttgtccgggt gttggacaca tttggttaac tggatagaat aacgcgagtt cccagggact 45480 

tggtccattt gctattttat tttattttat tttattttat tttatttatt tatttattta 45540 

tttatttatt tattgagatg gagtttcgtt tttgtcgccc aggctggagt gcagtggcgc 45600 

gatctcggtt cactgcaacc tctgcctccc aggttcaagt gattctccta cctcagcctt 45660 

ccaagtaact gggattacag gcacccacca ccataccagg ctaatttttt tgtattttta 45720 

gtagagacgg gttttcgcca ttttgcccag gctggtcttc aactcctagc ctcaggtgat 45780 

ccacgcacct cggcctccca aagtgctggg attacaggca tgagccacca cgcctggcac 45840 

catttgctat tttaattccc atgtgtatta gtgtcccacg gctgctgtaa caaatgacca 45900 

caaactggat ggcttaaagc aacagaaatg gattccccca atgtgctgga gaccagaagc 45960 

ctgcgaccaa actgttggga gggctgtgct tcctctgggg gctccaggga ggatctattt 4602 0 

gttggccctt ccagtgctgt gggtgccagc gttccacact tgtggatgcg ccgcctcaac 46080 

ctctgcccat cttcatgtgt ccatctcctt tgtgtctgcg tctttacctc ttcttcttgt 46140 

ctgtgttgcc tcttataagg acgtttgtca ttgggtttag ggcccaccca aatcatccga 46200 

gatgacctcg tcttgagatc cttaacctgc aaagaccctt tttccaaaaa aaggttatgc 46260 

tcacagattc taggccttaa gacatgggtg tatctttctg gggggcacta tccaacccct 4632 0 

tatacaatga aagacgggaa gagggccagg tgtggtagtt cacgcctgta atctcagcac 463 8 0 

tttaggaagc tgaagcggga ggatcacttg agcccaggag tttacaagta gctaggcaac 46440 

atgatgagac cccatttcta caaaaagtga aaaaaaaaaa aaaaaaaaaa aagccaggtg 4650 0 

tggtggctca cacctgtaat cccagcactt tgggaggctg aggcaggcag atcacgaggt 46560 

caggagattg agaccatcct ggctaacacg gtgaaacccc gtctctacta aaaatacaaa 4662 0 
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agtgcaatgg 


CytCat-Caty 


i— f /-1 4- ^ — j +— — * j— 1 -3 

yCLd.au uy_d 


y^c i>c L- ct v_/ i_- 


59220 


tctaggctca 


agcaatcctc 


ccatctcagc 


ccctaagt.ag 


CtayydO Lyt 


d.L<yuai--yLat> 


592 80 


ccccatgccc 


agctaatatt 


taCattLULL. 


y u ay cty a Ly a 


cty LLLUa - Uct 


L-CtL- Ly LL-Luy 


5 934 0 


gctggtc tec 


aac uccuyga 




****** ^ uyuv* u. 




a cr a f~ cjp t crcrcr 
<=Lyy <-y^ L yyy 


59400 


attacaggcg 


tgagccaccg 


4- /-v- ^<<-i+- t~rf~i s~* f-> +■* 

tycctyyt-LL 


yyyy u ct l. i_y u 




lcilu ty ul. y 


59460 


tggtgggccc 


tgggaaggaa 


gtagcagaag 


agggttcttc 


urggtucccu 


ggacaguaac 


CQC90 


tgagtgttct 


ggaggcccca 


gggcctggct 


ttgtttaggg 


acaaagggaa 


ctggtaacca 


COCOA 


gaagecgaga 


gtttaaacac 


ccactgccct 


tcttccctgc 


tcctgctgct 


gcaacccagc 


bi?o4 u 


ttaaccagcc 


aggagtgeta 


ggaacccaag 


cagggccccc 


gagcacacag 


c ayycayc u c 


c q 7 n n 


acgaattctc 


4-4- 4-4- ™ >~ 4- ,— - 4- 4- 

ttttcctgtt 


c ucccu uggg 


ay c tyyyayy 


oUCLtaaL-La 


y y L-ctct l. dciy ci 




gatggcactg 


agcagccagc 


taattUUtt-a 




4-4-^4-4-4--3-i/-i<i-i 

ttyLU LaauL 


ahat-nacira 


5982 0 


cccacttaaa 


aaagggtaca 


gttcagtggg 


4- f. 4~ 4- . A f.>.f -j 

cT-Euagugt-a 


uucacagaug 


uy uyOaaOCL 




tcaccacagt 


taattttaga 


acattttcct 


gcccctaaaa 


gaaactctgc 


aegaayecay 




ctgtttttaa 


attagcaaag 


ttattttgea 


4- .— a .« -4- -4- 4- An — 4— 

tcctttaaat 


atatgttcat 


ggtacaaaat 


/rr\ n fi n 


tcaaaagata 


cagaagagtc 


tgcagtccaa 


agagaccccg 


cccccacgac 


y ccaay vay y 




catccctggg 


aggcatggee 


tectgeagtg 


tgtttcttct 


atgtcccccc 


aggggtcatc 


c r\ 1 o n 
0 U ±-i u 


tgtacatatg 


caagcataca 


agagcgtgga 


m 4- 4— A- <v4— 4— 4- 4— j— 

ctttgttttc 


caagecagaa 


gataattgta 


0UI0U 


gatttatgtg 


cagttgtgag 


aaagagcaca 


gacccattta 


tcctctgcct 


ggtttccccc 




agtgctgcct 


gccatcttgc 


atgacttcca 


ttcctatcat 


aagcaagaca 


ctgataacga 


0 U j U U 


ttctttcacc 


ttattcagat 


tgacataagt 


gttttttgtt 


tgttcttgag 


acaaacttcc 


60360 


tctgtcaccc 


agtgggagtg 


cagtggcaca 


atcacagctc 


actgcagcct 


caaactcctg 


60420 


ggctcaagcg 


attctcctgc 


ctcagtcccc 


tcaagtagct 


cagatggcag 


gtgtgcacca . 


60480 


teatgecagg 


ctaattttta 


aattttttgt 


ggaggtgagg 


cctcactaaa 


tttcctgggc 


60540 
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l-dy l. uy dd 


p t~ p p t~ ctp. cr p 1~ 


aaaataatcc 


tcctgcctca 


gcctcccaaa gtggtaggat 


60600 


fapa acre* a t~ ci 
LctL-ctyy^a. Ly. 


ciy u^a^ uy\*y 


cctacrcrctcra 


catatgtgtt 


ttegtaagee 


cgaaagatag 


60660 


pAtPtcraacra 


at paacattcr 


agccttgcct 


tttgetgeta 


acgatgtata 


aaagctgctg 


60720 




t ut-yyciyyt- l 


nr , r t ?iCICf'CSCC , 


gtgtgcaccc 


tgectagage 


tetacegtaa 


60780 


y-w **\ +— /-i /-( 

CCCatCLCCy 


99 a 99 a 99 T: 9 


rl-^ttcrfttt* 
iML.aLuytL.tL 


pptpattttcr 


caacaaggag gctgaagaac 


60840 


tgagcatgaa 


^1 ^1 y^r /T*f r"** /"t ^ 

ccac uggccc 


gggtuy LUty 


nt* t nnt - p era p 
y u uyy uctyy 


agtggggcca ggccatccaa 


60900 


ctca.ca.acca. 


+— -f— 4— "3 4— /-« 


4- rrr 1 1~ +■ rrrrr 1 




gtttgttctg 


ttttgaggac 


60960 


a a /r /™t rr 4** /** 2 


t*ciL. L-w u uyy u 


rr^f r* t - cr s r3 p a a 

u w t* *n 


cacfccctfccft 


caggcgtggc 


tgggccccca 


61020 




LL.auyy t <w <^ l, 


ptptcctcrct 


gcggttgaac 


cttggctgtt 


tcaaccactc 


61080 


/-» +- C~r /~i ~S a 4* 

CuycUaayCy 


ycLut^ u.y dd 


ci a ci r raaf p 


catcttttct 


cagcagaggg 


ccacactggc 


61140 


aaaacggt cc 


ctgg cdcccL 


L- LtLtLvv^at 




tagagtaaaa 


atggtatcat 


61200 


gttaagatct 


4___i_i— 4— — +- — 4— 

tcatttatat 


4- 4- 2 4- +- 4- +* a 4- r* 


at*ria a 4- «a 4- <t 

augaatyciuy 


taagcatcat 


tttgtgtgtt 


61260 


taagaacctt 


tgggcccagc 


ycyauyycut 


yuay L uy t-ctct. 


tctcagcact 


ttaggaggct 


61320 


gagaugay eg 


y^r^ 4— ^ /■» 4" 4" /T 3 

gatCaCLLyd 


gycuyyyciy u 


u. l, y ciy ctuu cty 


cctggccaac 


atggagaaac 


61380 


j~* /-Y 4™ +• 4" ^ 

CCCytCUCua 


y Laadaa LIL 


cLcictctcic tciyt 




gatcccagct 


acttgggagt 


61440 


c tgaagca tg 


agaau ugc tu 


y detect uy y y ct 


gg c gy«-gy 


geagtgagee 


gagatcgege 


61500 


cattgcactc 


cagcctgggc 


gaeagagega 


gac ucLycct 


caaaaaaaaa 


aaaaaaaaag 


61560 


aaaagaaaag 


aaattatcaa 


tCtCCLCtUt 


4-a4- r^f~t pahal" 
UclLyyL.ctLaL, 


atatatatat 


atatatatat 


61620 


at at at a tat 


_ 1— _4_— 4_ — 4-4-4— 


LUtttCLLUy 


y L, L. ct. L,y L. L-(— cl 


gaaaggcett 


ccctgctctg 


61680 


-^i 4— 4— — * — * — i 

duCataaaaa 




L. LL/CttCtt LL U- 


rlTt'ettttt 


tttttgagac 


agagttttgc 


61740 


^ n ^* ^ /~r ^ ^ /^f ^ /"* 

tCCtyctyCC 


caggccggay 


ty cay t-yy <-y 


L.CICIL.L. L> ^ CLy \^ 


tcactgtaac 


ctccgcctcc 


61800 


c 999 tt: 99 a 9 


4-* r-x — > 4— /— » 4— >i 

LyatLCCCCU 


(^i 4— 4— a nrif f 
yC-CL-U-CLl^^L-L. 


UvrfLy ay tayi- 


tgggattata 


ggcatgcacc 


61860 


accatgcctg 


gctaattttg 


caccctuagc 


agagaegggg 


gtttctccat 


gttggtcagg 


61920 


ctggtctcga 


actcgcgacc 


tcaggtgatc 


cacccacccc 


ggcctcccaa 


agtgctggga 


O X .7 O <J 


ttacagacgt 


gage cac cat 


y-Y /-« /~t /~i a <T P ja 

yttL- cty o o l> ct 


/~ta/~ , +~^ , i~i»-»4--4-4- 


cttaacgtcc 


tcctcctttc 


62040 


gttttacgtt 


cacatcttta 


attcttctgg 


gatgtaatta 


gatttgatga 


gcaaggtggg 


62100 


catccagctt 


gtttcttggc 


tgatggctta 


tgggtggcgt 


gaattagtcg gggtctatca 


62160 


ggaggcagaa 


actctatgag 


aatttgaaca 


gagaaagttc 


cgtctacagg 


cttattacca 


62220 


gggactggaa 


tagcagaaat 


tgaacagtga 


gatgtacaga 


gaactctaag 


aatgcaggaa 


62280 
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taggccaggc 


atggtggctc acacctgtca 


tcccagcact 


ttgggagacc 


aacrqcQcrcrtq 


62 34 0 


gatcacctga 


ggtcaggagt tcgagaccag 


cctggccaac 


atagtgaaac 


cccatctcta 


62400 


ctaaaaatac 


aaaaaaatta gctgggtgtg gtggcgcatg 


cctgtaatcc 


cagctactcg 


62460 


ggaggctgag gcaggagaat cacttgaacc 


tgggaggcag aggttgcagt 


gagccgagat 


62520 


catgccactg tactccagcc tgggtggaag agcggaactc tgtctgaaaa 


aaaaaaaaaa 


62580 


aacaagaagt tcaacttgaa gggaaaaatg ccgtattgtc tttccctttg ttatgtcacc 


62640 


agggcacagt 


ccatcccagg ctggcgctga 


tccacgggct 


ggagaggggc 


tgccccagaa 


62700 


gaggacatgc 


caggaagggc ttggctggtg 


ttcaggagcc 


caggccaggt 


caggtcaaga 


62760 


ggtgttgagg 


ctggacggga gaggccagct 


aggggctcat 


gtaggatatg 


aggggtcggc 


62820 


ccatttcaac 


gtggaaactg agctcttctg 


cttctctttc 


ttcttcactg 


cattaagatt 


62880 

W U WWW 


caataccgct 


tgggaagcag gtatttccct 


tcctataaag gatggttggg agcctgagtg 


62 94 0 


ttgggagaaa gtgtagccgc tgagttacta acaactaggg ctgccgtcaa gcctatgggg 


63000 


aaagagagaa 


gaggacattt- ggaaggagag 


agatcaagct 


gtggcaccct 


gggagaggac 


63060 


cacagaaaag 


aggccagtga gggggttccc 


cggtggcatc 


tgaaggtgtg 


gcccaaccag 


63120 


gaggtccaga 


ggctgccagc cgagtggccc 


aggagaggga 


acctcacagg ggctgagtgg 


63180 


gacccaagcc 


ctatccaccg tcctaaccac 


ccacatttct 


cgggaacaag 


acctcccaca 


63240 


gtggcctccc 


cggcagtgga aatagccaaa 


ctggcaacat 


ggactttctt 


caactgcccg 


63300 


ggcgatgctg 


cctcagtgcc ccagggcagg 


caggaagctc 


ccacacccat 


tctggaatga 


U J J UU 


ggggttggag 


gaaggctgag ctgagcaaag 


gacccatctc 


tgctctggtt 


ggtggggagg 


6342 0 


gagcccatta 


tacaagagac ccctcagggc 


tcagtgaggg 


gtgacagaga 


cttggggagt 


634 8 0 


agtggctgtc 


actgcagagg tgagagggtt 


tggagagaag 


gtacatgcct 


tt-ttggccac 


63540 


attgagtagc 


acctggtagc cagttagtaa 


cgtgtattgg 


ataaacaaaa 


gattaaacgg 


for n a 

OjOUU 


atgcaaaaaa 


aaatgttggc tttgcttctt 


tttacccaaa 


cctcagttcc 


ctcaagtaga 


O J O O \J 


ttctgggaac 


accccctacc tggctggact 


gttgtgaagt 


ttaaataagc 


caggttaact 




tcacctcctc 


ct ttaagaca cagctcagac 


actgcctcct 


ccaagaagcc 


ccctctggct 


£ 7 R C\ 


tcctgtgtga 


atatgacggc cctctgggct 


ctagggtatc 


ttagaacaat 


gcttccttat 


63840 


ggctttggaa 


ccccgctgtc tcctggattg ggagcaaatg 


caggggagga gccacacctg 


63900 


actaatctct 


gggtctccca gcacataagt 


ggcataaggg 


cagggctgtg 


cccgcttcag 


63960 


gcacttactg 


aaggatgtac ttggcagagg 


gtaggcagcc 


ggcggatgag 


cccctcactc 


64020 
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tccccagctg actgcgtggg cgggaaaggc gggttcagga gacccagcct 


ccctgggctg 


64080 


tcaccacctc 


tgcacatcca 


gccccattga 


tcaagggttc 


aatttttggg 


gtcctgttgg 


64140 


gaggccagga gactctctcc 


aggcacttct tccaggtctt tgtgttaggg tgtgtgtgtg 


64200 


tgtgtgtgtg tgtgtgtgtg tgtgttgttt gttttatttt 


atttatttat 


ttatttattt 


64260 


atttatttat 


ttatttattt 


tqaqacgcag 


tctcgctctg 


ttgcccaggt 


tggagggtgg 


64320 


tggcatgatc 


tcggctcact 


gcaagctccg 


cctcccgggt 


tcacgccatt 


ctcctgcctc 


64380 


actcttcctg 


agtagccgga 


ttacaggcgc 


acgcaccatg 


cctggctaat 


tattttgttt 


64440 


ttttagtaga gacagggttt 


cgccacgttg 


cccaggctgg 


tcttgaatcc 


ctggcctcaa 


64500 


gcgatccgcc 


cgcctcagcc 


tcccaaagtg 


ctgggattac 


aggcgtgagc 


caccgtgccc 


64560 


gcccagccta 


ggggtacatg 


aaactttttt 


tttttttttt 


ttgagacaga gtttcactct 


64620 


gtcctcaggc 


tggagtgcag 


tggcgtgatc 


tcggcgtact 


gcaatctccg 


cctcccggtt 


64680 


caagcgattc 


tcctgcctca 


gcctcccgag 


tagctgggat 


tgcaggcacg 


cgccaccaca 


64740 


cccagctaat 


ttttgtattt 


ttagtagaga 


cgggctttca 


ccatgtggga 


caggatggtc 


64800 


tcgatctcct 


gacctcgtga 


tccgcccgcc 


tcagcctccg 


aaagtgctgg 


gattacaggc 


64860 


ctgagccacc 


gtgcccagcc 


atgatgtttt 


gatacaggca tataacgtat 


aataatcaca 


64920 


tcagggtaaa 


tgatgtaacc 


atcacatcaa 


gcatttatcc 


tttgtgttac 


aaaaaaaaat 


64980 


ctaattatac 


tttcctactt 


attctttttt 


tttttttttt 


ttgagacgga 


gtctccctca 


65040 


gtcgcccagg ctggagtgca gtggcatgat ctcagttcac tgcaagctct gcctcctagc 


65100 


tctgcctcct 


gggttcatgc 


cattctcctg tctcagcctc 


gcgagtagct 


gggactacag 


65160 


gcgcctgcca 


ccgtgcccgg 


ctaatttttt 


tttttgtatt 


tttggtagag 


acagggtttc 


65220 


accgtgttag 


ccaggatggt 


ctcgatctcc 


tgacctcata 


atccgcccgt 


ctcggcctcc 


65280 


caaagtgctg 


ggattacagg 


catgagccac 


cgcccccagc 


ctatttattc 


ttaaatgtac 


65340 


aataaattat 


tgttgactcc 


agtcaccctg 


ctgtgctacc 


aaatacggat 


cttcttcatt 


65400 


ctatctaact 


gtatttctgt 


acctgttaac 


catctctcct 


ccacctcacc 


ccccaaaccc 


65460 


actacccttc 


tcagcctctg 


gtaaccatcc 


ttctactctc 


tatctctatg 


agttcaattg 


65520 


tattaatttt 


tagctccccg gccgggcacg gtggctcacg cctgtaatcc 


cagcacttca 


65580 


ggaggctgag 


gcaggtggat 


cacgaggtca 


ggagtttgag 


accagcctgg 


ccaacatggt 


65640 


ggaaccccat 


ctctactaaa 


aacacaaaaa 


ttagctgggc 


gtggtggtgg 


gcgcttgtag 


65700 


tcccagctac 


ttgggaggct 


gaggcaggag 


aatcgcttga 


aactgggagg 


cagaggttgc 


65760 
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agtgagccaa gattgcgcca 


rfcrcactcca 

\hH I— C~L \—* W \m+ V t**- . 


atctocrcrtcra 


cagagtaaga 


ttccatcccg 


65820 


aaaaaaaaaa 


agtttagctc 


ccacaaataa 


gtgagaacac 


gtgaagtttc 


tctttctgtg 


65880 


cctcgcttgt 


ttcacttaac 


ataataacct 


ccagttccat 


ccacgttgtt 


gctttgttat 


65940 


aaatgacagg 


atcttggtca 


cr cr r* cr r* a cr t cr cr 
yy ^y v»y | -yy 


ctcatgcctg 


taatcccagc 


actttgggag 


66000 


gctgaggtgg actgatcatg 


yy ^-^^c^y^-y 


atcgagacca 


tcctggctaa 


cacagtgaaa 


66060 


ccccgtctct 


actaaaaata 




gccgggcgtg gtggtgggca 


cccatttccg 


66120 


ccccttctcg ggacgctgat 


crfp raa cat* a 


ttacccatcc 


ccggaagact 


aatcctcccc 


66180 


cactctatat 


tgtacctctt 


r*r , i _ t"t"r*'t"r"r*l~ 


ccacgcgatt 


ccccgagtaa 


cccgtcttcc 


66240 


ctccctcctc 


ggattacgct 




cttcaatcac 


gttgctccgt 


cccic\it.cccc 


66300 


attcgtacca 


ctcctcactt 


4- i-trTt" <■*• t" 1™ C* f™ 1 1" 


acccccacta 


tcccttttcg 


t"r*r*i" ct~ ctat 


66360 


tccttactta 


ctcctccccc 


4- 4- ~ •+-— 4- 4- <=> 4- 

ttctctucai 


acttcattcc 


ctccgctctt 






ctcccacttt 


cacctagttg 


i />~t /—i -4— r-t — ] i +- a 


cgttgccatc 


tcgccccttc 




66480 


ggcctctcac 


ccatctgtcc 


LL/ L LCIU 


ctctctcctc 


atctcgctca 


gacatctctc 


66540 


tagactatcc 


ctcactttac 


LUULLt cty L. u 


gtcttcttcc 


tatccttcgt 


fctccataat 


66600 


cttcacgtcg 


ccatctcttt 


4— orr/^ y"i ft /-» "H t~ +*" 

ucgcccctuu 


catatgtctc 


tcttcatgtt 


php3Pt"afpa 

L-L^-ClL.L.ClLvQ. 


66660 

V V v %/ 


ttctcatgat 


cactatcgtt 


ULa^ LatLL 


atcactcccc 


tctttcttca 


fpaatfcctc 

lr *i_> C4. U. <w ^ l» 


66720 


tccgtcattc 


tcgtctctct 


cttacaaccg 


ccttccttgt 


gctatctaac 


tcaaccatgc 


66780 


ctctcctact 


ctctctctat 


cgcccctcca 


tcgcttatgc 


atcctcttct 


attgcacacc 


66840 


cgcccctcca 


tcgcttatgc 


atcctcttct 


attgcacacc 


gcccctccat 


cgcttatgca 


66900 


tcctcttcta 


ttgcacatcc 


tcttctattg 


cac 
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<210> 12 
<211> 21 
<212> DNA 

<2 13 > Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 

<400> 12 

ctgagcggaa ttcgtgagac c 



<210> 13 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer 
<400> 13 

ttggtctcac gtattccgct cga 

<210> 14 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer 
<400> 14 

ctcgagaatt ctggatcctc 

<210> 15 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer 
<400> 15 

ttgaggatcc agaattctcg ag 

<210> 16 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Artificial sequence is a primer. 
<400> 16 

tgtatgcgaa ttcgctgcgc g 21 



<210> 17 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 17 

ttcgcgcagc gaattcgcat aca 23 

<210> 18 
<211> 21 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 18 

gtccactgaa ttctcagtga g 21 

<2io> 19 , 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 19 

ttgtcactga gaattcagtg gac 23 

<210> 20 
<211> 21 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 20 

gaatccgaat tcctggtcag c 21 

<2io> 21 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 21 

ttgctgacca ggaattcgga ttc 2 3 

<210> 22 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 22 

cuacuacuac uactgagcgg aattcgtgag acc 3 3 

<210> 23 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 

<400> 23 



cuacuacuac uactcgagaa ttctggatcc tc 

<210> 24 
<211> 33 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Artificial sequence is a primer 
<400> 24 

cuacuacuac uatgtatgcg aattcgctgc gcg 

<210> 25 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer 
<400> 25 

cuacuacuac uagtccactg aattctcagt gag 

<210> 26 
<211> 33 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer 
<400> 26 

cuacuacuac uagaatccga attcctggtc age 

<210> 27 
<211> 45 
<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Artificial sequence is a primer. 
<400> 27 

aactggaaga attcgcggcc gcaggaattt tttttttttt ttttt 45 

<210> 28 
<211> 13 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 28 

aattcggcac gag 13 

<210> 29 
<211> 9 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a. primer. 
<400> 29 

. ctcgtgccg 9 

<210> 30 
<211> 14 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 30 

gtacgacggc cagt 14 
<210> 31 



184 



<211> 16 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 31 

aacagctatg accatg 16 

<210> 32 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 

<400> 32 

ccaagttctg agaagtcc 18 

<210> 33 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
. <400> 33 



aatacctgaa accatacctg 

<210> 34 
<211> 57 . 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 



20 
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<400> 34 

agctgctcgt agctgtctct ccctggatca cgggtacatg tactggacag actgggt 

<210> 35 
<211> 56 
<212> DNA 

<213> Artificial Sequence 
<220> ' 

<223> Artificial sequence is a primer. 
<400> 35 

tgagacgccc ggattgagcg ggcagggata gcttattccc tgtgccgcat tacggc 

<210> 36 
<211> 27 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer, 
<400> 36 

agctgctcgt. agctgtctct ccctgga 

<210> 37 

<211> 27 

<212> DMA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer, 

<400> 37 

gccgtaatgc ggcacaggga ataagct 

<210> 38 
<211> 20 
<212> DNA 
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<213> Artificial Sequence 

<220> 

<223> Artificial sequence is a primer. 

<400> 38 

gagaggctat atccctgggc 

<210> 39 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Artificial sequence is a primer. 
<400> 39 

acagcacgtg tttaaagggg 



<210> 40 

<211> 163 

<212> DNA 

<213> Homo sapiens 

<400> 40 

actaaagcgc cgccgccgcg ccatggagcc 
gccggacaac atggaggcag ctccgcccgg 
gctgctgctg gcgctgtgcg gctgcccggc 



<210> 41 

<2ll> 419 

<212> DNA 

<213> Homo sapiens 

<400> 41 

gccccacagc ctcgccgctc ctgctatttg 
ccggcggagt caagctggag tccaccatcg 
tggacttcca gttttccaag ggagccgtgt 
agcagaccta cctgaaccag acgggggccg 
tctctcccga cggcctcgcc tgcgactggg 
agaccaaccg catcgaggtg gccaacctca 
aggaccttga ccagccgagg gccatcgcct 



cgagtgagct cggcgcgggc ccgtccggcc 60 
gccgccgtgg ccgctgctgc tgctgctgct 120 
ccccgccgcg gcc 163 



ccaaccgccg ggacgtacgg ctggtggacg 60 

tggtcagcgg cctggaggat gcggccgcag 12 0 

actggacaga cgtgagcgag gaggccatca 180 

ccgtgcagaa cgtggtcatc tccggcctgg 240 

tgggcaagaa gctgtactgg acggactcag 3 00 

atggcacatc ccggaaggtg ctcttctggc 3 60 

tggaccccgc tcacgggtaa accctgctg 419 



I; 
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<210> 42 
<211> 221 
<212> DNA 

<213> Homo sapiens 



<400> 42 

ccccgtcaca ggtacatgta ctggacagac tggggtgaga cgccccggat tgagcgggca 
gggatggatg gcagcacccg gaagatcatt gtggactcgg acatttactg gcccaatgga 
ctgaccatcg acctggagga gcagaagctc tactgggctg acgccaagct cagcttcatc 
caccgtgcca acctggacgg ctcgttccgg taggtaccca c 



60 
120 
180 
221 



<210> 43 
<211> 221 
<212> DNA 



<213> Homo sapiens 
<400> 43 

tccctgactg caggcagaag gtggtggagg gcagcctgac gcaccccttc gccctgacgc 
tctccgggga cactctgtac tggacagact ggcagacccg ctccatccat gcctgcaaca 
agcgcactgg ggggaagagg aaggagatcc tgagtgccct atactcaccc atggacatcc 
aggtgctgag ccaggagcgg cagccttttt gtgagtgccg g 



60 
12 0 
180 
221 



<210> 44 

<211> 156 

<212> DNA 

<213> Homo sapiens 

<400> 44 

tttctcagtc cacactcgct gtgaggagga caatggcggc tggtcccacc tgtgcctgct 
gtccccaagc gagccttttt acacatgcgc ctgccccacg ggtgtgcaga tgcaggacaa 
cggcaggacg tgtaaggcag gtgaggcggt gggacg 



60 

120 

156 



<210> 45 

<211> 416 

<212> DNA 

<213> Homo sapiens 

<400> 45 

ctccacagga gccgaggagg tgctgctgct ggcccggcgg acggacctac ggaggatctc 
gctggacacg ccggacttca ccgacatcgt gctgcaggtg gacgacatcc ggcacgccat 
tgccatcgac tacgacccgc tagagggcta tgtctactgg acagatgacg aggtgcgggc 
catccgcagg gcgtacctgg acgggtctgg ggcgcagacg ctggtcaaca ccgagatcaa 
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cgaccccgat ggcatcgcgg tcgactgggt ggcccgaaac ctctactgga ccgacacggg 300 
cacggaccgc- atcgaggtga cgcgcctcaa cggcacctcc cgcaagatcc tggtgtcgga 360 
ggacctggac gagccccgag ccatcgcact gcaccccgtg atggggtaag acgggc 416 



<210> 46 

<211> 198 

<212> DNA 

<213> Homo sapiens 

<400> 46 

ttcttctcca gcctcatgta ctggacagac tggggagaga accctaaaat cgagtgtgcc 60 
aacttggatg ggcaggagcg gcgtgtgctg gtcaatgcct ccctcgggtg gcccaacggc 12 0 
ctggccctgg acctgcagga ggggaagctc tactggggag acgccaagac agacaagatc 180 
gaggtgaggc tcctgtgg 15 8 

<210> 47 

<211> 244 

<212> DNA 

<213> Homo sapiens 

<400> 47 

ccgtcctgca ggtgatcaat gttgatggga cgaagaggcg gaccctcctg gaggacaagc 60 
tcccgcacat tttcgggttc acgctgctgg gggacttcat ctactggact gactggcagc 12 0 
gccgcagcat cgagcgggtg cacaaggtca aggccagccg ggacgtcatc attgaccagc 180 
tgcccgacct gatggggctc aaagctgtga atgtggccaa ggtcgtcggt gagtccgggg 240 
ggtc 244 



<210> 48 

<211> 313 

<212> DNA 

<213> Homo sapiens 

<400> 48 

gttcgcttcc aggaaccaac ccgtgtgcgg acaggaacgg ggggtgcagc cacctgtgct 60 
tctgcacacc ccacgcaacc cggtgtggct gccccatcgg cctggagctg ctgagtgaca 120 
tgaagacctg catcgtgcct gaggcctttt tggtcttcac cagcagagcc gccatccaca 180 
ggatctccct cgagaccaat aacaacgacg tggccatccc gctcacgggc gtcaaggagg 24 0 
cctcagccct ggactttgat gtgtccaaca accacatcta ctggacagac gtcagcctga 300 
aggtagcgtg ggc 313 



<210> 49 
<211> 255 



<212> DNA 



f. 
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<213> Homo sapiens 
<400> 49 

cctgctgcca gaccatcagc cgcgccttca tgaacgggag ctcggtggag cacgtggtgg 60 

agtttggcct tgactacccc gagggcatgg ccgttgactg gatgggcaag aacctctact 120 

gggccgacac tgggaccaac agaatcgaag tggcgcggct ggacgggcag ttccggcaag 180 

tcctcgtgtg gagggacttg gacaacccga ggtcgctggc cctggatccc accaaggggt 240 

aagtgtttgc ctgtc 255 



<210> 50 

<2ll> 210 

<212> DNA 

<213> Homo sapiens 

<400> 50 

gtgccttcca gctacatcta ctggaccgag tggggcggca agccgaggat cgtgcgggcc 60 
ttcatggacg ggaccaactg catgacgctg gtggacaagg tgggccgggc caacgacctc 12 0 
accattgact acgctgacca gcgcctctac tggaccgacc tggacaccaa catgatcgag 180 
tcgtccaaca tgctgggtga gggccgggct 210 

<210> 51 

<211> 352 

<212> DNA 

<213> Homo sapiens 

<400> 51 

gtgttcatgc aggtcaggag cgggtcgtga ttgccgacga tctcccgcac ccgttcggtc 60 
tgacgcagta cagcgattat atctactgga cagactggaa tctgcacagc attgagcggg 12 0 
ccgacaagac tagcggccgg aaccgcaccc tcatccaggg ccacctggac ttcgtgatgg 180 
acatcctggt gttccactcc tcccgccagg atggcctcaa tgactgtatg cacaacaacg 240 
ggcagtgtgg gcagctgtgc cttgccatcc ccggcggcca ccgctgcggc tgcgcctcac 300 
actacaccct ggaccccagc agccgcaact gcagccgtaa gtgcctcatg gt 352 

<210> 52 

<211> 225 

<212> DNA 

<213> Homo sapiens 

<400> 52 

gcctcctcta cgcccaccac cttcttgctg ttcagccaga aatctgccat cagtcggatg 60 
atcccggacg accagcacag cccggatctc atcctgcccc tgcatggact gaggaacgtc 120 
aaagccatcg actatgaccc actggacaag ttcatctact gggtggatgg gcgccagaac 180 
atcaagcgag ccaaggacga cgggacccag gcaggtgccc tgtgg 1 225 



I 
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<210> 53 

<211> 235 

<212> DNA 

<213> Homo sapiens 



<400> 53 

ctttgtctta cagccctttg ttttgacctc tctgagccaa ggccaaaacc cagacaggca 
gccccacgac ctcagcatcg acatctacag ccggacactg ttctggacgt gcgaggccac 
caataccatc aacgtccaca ggctgagcgg ggaagccatg ggggtggtgc tgcgtgggga 
ccgcgacaag cccagggcca tcgtcgtcaa cgcggagcga gggtaggagg ccaac 

<210> 54 

<211> 218 . 

<212> DNA 

<213> Homo sapiens 

<400> 54 

ccaccctccc gcaggtacct gtacttcacc aacatgcagg accgggcagc caagatcgaa 
cgcgcagccc tggacggcac cgagcgcgag gtcctcttca ccaccggcct catccgccct 
gtggccctgg tggtggacaa cacactgggc aagctgttct gggtggacgc ggacctgaag 
cgcattgaga gctgtgacct gtcaggtacg cgccccgg 



<210> 55 
<211> 234 
<212> DNA 
<213> Homo sapiens 
<400> 55 

ggctgcttgc aggggccaac cgcctgaccc tggaggacgc caacatcgtg cagcctctgg 
gcctgaccat ccttggcaag catctctact ggatcgaccg ccagcagcag atgatcgagc 
gtgtggagaa gaccaccggg gacaagcgga ctcgcatcca gggccgtgtc gcccacctca 
ctggcatcca tgcagtggag gaagtcagcc tggaggagtt ctgtacgtgg gggc 



60 
120 
180 
235 



60 
120 
180 
218 



<210> 56 

<211> 157 

<212> DNA 

<213> Homo sapiens 

<400> 56 

ttgtctttgc agcagcccac ccatgtgccc 
ttgccaaggg tgatgggaca ccacggtgct 
acctgctgac ctgtggaggt aggtgtgacc 



gtgacaatgg tggctgctcc cacatctgta 6 0 
catgcccagt ccacctcgtg ctcctgcaga 12 0 
taggtgc 157 
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<210> 57 

<211> 272 

<212> DNA 

<213> Homo sapiens 

<400> 57 

gttctcctct gtccctcccc cagagccgcc cacctgctcc ccggaccagt ttgcatgtgc 60 
cacaggggag atcgactgta tccccggggc ctggcgctgt gacggctttc ccgagtgcga 120 
tgaccagagc gacgaggagg gctgccccgt gtgctccgcc gcccagttcc cctgcgcgcg 180 
gggtcagtgt gtggacctgc gcctgcgctg cgacggcgag gcagactgtc aggaccgctc 240 
agacgaggtg gactgtgacg gtgaggccct cc 272 



<210> 58 

<211> 134 

<212> DNA 

<213> Homo sapiens 

<4Q0> 58 

tctccttgca gccatctgcc tgcccaacca gttccggtgt gcgagcggcc agtgtgtcct 60 
catcaaacag cagtgcgact ccttccccga ctgtatcgac ggctccgacg agctcatgtg 12 0 
tggtgagcca gctt 134 

<210> 59 

<211> 274 

<212> DNA 

<213> Homo sapiens 



60 



<400> 59 

gtttgtctct ggcagaaatc accaagccgc cctcagacga cagcccggcc cacagcagtg 

ccatcgggcc cgtcattggc atcatcctct ctctcttcgt catgggtggt gtctattttg 120 

tgtgccagcg cgtggtgtgc cagcgctatg cgggggccaa cgggcccttc ccgcacgagt 180 

atgtcagcgg gaccccgcac gtgcccctca atttcatagc cccgggcggt tcccagcatg 240 

gccccttcac aggtaaggag cctgagatat ggaa 2 74 



<210> 60 

<2ll> 164 

<212> DNA 

<213> Homo sapiens 

<400> 60 



cttccctgcc aggcatcgca tgcggaaagt ccatgatgag ctccgtgagc ctgatggggg 60 
gccggggcgg ggtgcccctc tacgaccgga accacgtcac aggggcctcg tccagcagct 12 0 
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cgtccagcac gaaggccacg ctgtacccgc cggtgagggg cggg 



164 



<210> 61 

<211> 130 

<212> DNA 

<213> Homo, sapiens 

<400> 61 

ttggctctcc tcagatcctg aacccgccgc cctccccggc cacggacccc tccctgtaca 
acatggacat gttctactct tcaaacattc cggccactgc gagaccgtac aggtaggaca 
tcccctgcag 



60 

120 

130 



<210> 62 

<211> 496 

<212> DNA 

<213> Homo sapiens 



<400> 62 

tcaaacattc 

ccgacgacgc 

gccagcaagt 

ccccacagcc 

agctacttcc 

ccgggccact 

aatatatttt 

aactgtgatg 

ttaattttgt 



cggccactgc 
cctgcagcac 
actacctgga 
agtacctgtc 
atctcttccc 
ctggcttctc 
atgatttaaa 
gggtgggcag 
aaaaca 



gagaccgtac 
cgacgtgtgt 
tttgaactcg 
ggcggaggac 
gccccctccg 
tgtgcccctg 
aaataaatat 
ggctgggaga 



aggccctaca 
gacagcgact 
gactcagacc 
agctgcccgc 
tccccctgca 
taaatagttt 
aattgggatt 
actttgtaca 



tcattcgagg 
acagcgccag 
cctatccacc 
cctcgcccgc 
cggactcatc 
taaatatgaa 
ttaaaaacat 
gtggagaaat 



aatggcgccc 
ccgctggaag 
cccacccacg 
caccgagagg 
ctgacctcgg 
caaagaaaaa 
gagaaatgtg 
atttataaac 



60 

12 0 

180 

240 

300 

360 

420 

480 

496 



